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Abstract 

This article is about Artin's braid group B„ and its role in knot theory. We set our- 
selves two goals: (i) to provide enough of the essential background so that our review 
would be accessible to graduate students, and (ii) to focus on those parts of the subject 
in which major progress was made, or interesting new proofs of known results were 
discovered, during the past 20 years. A central theme that we try to develop is to show 
ways in which structure first discovered in the braid groups generalizes to structure in 
Garside groups, Artin groups and surface mapping class groups. However, the liter- 
ature is extensive, and for reasons of space our coverage necessarily omits many very 
interesting developments. Open problems are noted and so-labelled, as we encounter 
them. A guide to computer software is given together with a 10 page bibliography. 
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1 Introduction 



In a review article, one is obliged to begin with definitions. Braids can be denned by very 
simple pictures such as the ones in Figure ^ Our braids are illustrated as oriented from left 
to right, with the strands numbered 1,2, ... ,n from bottom to top. Whenever it is more 
convenient, we will also think of braids 'vertically', i.e., oriented from top to bottom, with 
the strands numbered 1,2, ... ,n from left to right. Crossings are suggested as they are in a 
picture of a highway overpass on a map. The identity braid has a canonical representation in 
which two strands never cross. Multiplication of braids is by juxtaposition, concatenation, 
isotopy and rescaling. 




X Y XY 



Figure 1: Examples of 4-braids X, Y and their product XY 

Pictures like the ones in Figure ^ give an excellent intuitive feeling for the braid group, 
but one that quickly becomes complicated when one tries to pin down details. What is 
the ambient space? (It is a slice R 2 x I of 3-space.) Are admissible isotopies constrained 
to M 2 x 11 (Yes, we cannot allow isotopies in which the strands are allowed to loop over 
the initial points.) Does isotopy mean level-preserving isotopy? (No, it will not matter 
if we allow more general isotopies in M? x /, as long as strands don't pass through one- 
another.) Are strands allowed to self-intersect? (No, to allow self-intersections would give 
the 'homotopy braid group', a proper homomorphic image of the group that is our primary 
focus.) Can we replace the ambient space by the product of a more general surface and an 
interval, for example a sphere and an interval? (Yes, it will become obvious shortly how to 
modify the definition.) 

We will bypass these questions and other related ones by giving several more sophisti- 
cated definitions. In fcll.ll £11.21 and 81.31 we will define the braid group B n and pure braid 
group P n in three distinct ways. We will give a proof that two of them yield the same 
group. References to the literature establish the isomorphism in the remaining case. In 
81.41 we will demonstrate the universality of 'braiding' by describing four examples which 
show how braids have played a role in parts of mathematics which seem far away from knot 
theory. 

1.1 B n and P n via configuration spaces 

We define the topological concept of a braid and of a group of braids via the notion of a 
configuration space. This approach is nice because it gives, in a concise way, the appropriate 
equivalence relations and the group law, without any fuss. 

The configuration space of n points on the complex plane C is: 

Co,h = C Q)ft (C) = {(z 1 , . . . , z n ) e C x . . . x C | Zi / Zj if % / j}. 
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A point on Co,n is denoted by a vector z = (z%, . . . , z n ). The symmetric group acts freely 
on Co,n, permuting the coordinates in each z S Co,n- The orbit space of the action is 
Co,n = Co,n/^n and the orbit space projection is r : C ,n — > Co,n- Choosing a fixed base 
point p = (pi, . . . we define the pure braid group P n on n strands and the braid group 
B n on n strands to be the fundamental groups: 

P n = 7Ti(C 0) ft,p), B n = 7ri(C ,„,r(p)). 

At first encounter 7ri(Co n , doesn't look as if it has much to do with braids as 

we illustrated them in Figure ^ but in fact there is a simple interpretation which reveals 
the intuitive picture. While the manifold Co,n has dimension 2n, the fact that the points 
zi, . . . ,z n are pairwise distinct allows us to think of a point z G Co,n as a set of n distinct 
points on C. An element of 7Ti(Co,n> t(p)) is then represented by a loop which lifts uniquely 
to a path g : I — > Cq^, where g = (gx, . . . ,g n ) consists of n coordinate functions gi : I — > C 
satisfying gi{t) ^ 9j(t) if i ^ j, t £ [0, 1], also g(0) = g(l) = p, the base point. The graph 
of the n simultaneous functions gx,...,g n is a geometric (pure) braid. The appropriate 
equivalence relation on geometric braids is captured by simultaneous homotopy of the n 
simultaneous paths, rel their endpoints, in the configuration space. 

The group B„ is the group which Artin set out to investigate in 1925 in his seminal 
paper |B] (however he defined it in a less concise way); in the course of his investigations 
he was led almost immediately to study P n . Indeed, the two braid groups are related in 
a very simple way. Let r* be the homomorphism on fundamental groups which is induced 
by the orbit space projection r : Co,n - ► Co,n- Observe that the orbit space projection is a 
regular n!-sheeted covering space projection, with T, n as the group of covering translations. 
From this is follows that the group P n is a subgroup of index n\ in B n , and there is a short 
exact sequence: 

1 — ► P n ^^B n — ► S n — ► 1 (1) 
1.2 B„ and P n via generators and relations 

We give two definitions of the group B ra by generators and relations. The classical presentation 
for B n first appeared in jSj. We record it now, and will refer back to it many times later. 
It has generators ax,... , o n -\ and defining relations: 

CjCfc = CfcCj if \i — A; | > 2, aia i+ iai = ai+x<?i<?i+\- (2) 

The elementary braid <7j is depicted in sketch (i) of Figure [21 

Many years after Artin did his fundamental work, Birman, Ko and Lee discovered 
a new presentation, which enlarged the set of generators to a more symmetric set. Let 
<r S)t = (cr t _i---cr s+ i)(7 s ((T^ 1 ---(7^ 1 1 ), where 1 < s < t < n. We define a s j = at,s, and 
adopt the convention that whenever it is convenient to do so we will write the smaller 
subscript first. The new presentation has generators {cr Sj j, 1 < s < t < n} and defining 
relations: 

cr s ,tcr q ,r = cr q ,rcr s ,t if (* - r)(t - q)(s - r)(s - q) > 0, (3) 

<7 S ,t(7r,s = &r,t<7s,t = <?r,s<Tr,t if l<r<S<t<Tl. 
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Figure 2: (i) The elementary braid <Xj. (ii) The elementary braid a s (hi) The pure braid 
A 8 ,t = A Sjt = a 2 st . 

See sketch (ii) of Figure [21 for a picture of a S) t, and :23 for a proof that (J2J) and © define the 
same group. Both presentations will be needed in our work. Note that the new generators 
include the old ones as a proper subset, since Oi = er^j+i for each i = 1, 2, . . . , n — 1. 

By (dJ the pure braid group P n has index ti! in B n . Let j4 Sj £ — Ai tS — c^j- (See sketch 
(iii) of Figure |2J). The symmetry A s j = At. s can be seen by tightening the s th strand at 
the expense of loosening the t th strand. It is proved in [§] and also in |65j that P n has a 
presentation with generators A r<s , 1 < r < s < n and defining relations: 



A r s AijA r>s = Aij if l<r<s<i<j<n or l<i<r<s<j<n 
= A r jAijA~j if 1 < r < s = i < j < n, 

= (AijA^Ai^AijAaj)- 1 if l<r = i<s<j<n, 

= (A r j, A S jA~jA~j)A it j(A ri j, AgjA'jAjj)" 1 if 1 < r < i < s < j < $4) 

The relations in come from the existence of a split short exact sequence, for every 
k = 2, . . . , n: 

{1} - F n _! - P n ^ P n _! -» {1}. (5) 

The map -zr* is defined by pulling out the last braid strand, and the image of P n _i under 
its inverse embeds P n _i in P n , as the subgroup generated by pure braids on the first n — 1 
strands. The free subgroup F n _i is generated by the braids A\ jTl , A2 >n , ■ ■ ■ , A n -\ jTl . The 
pure braid group P2 is infinite cyclic and generated by A\^- Inducting on n, the structure 
of P n via a sequence of semi-direct products is uncovered. 



1.3 B„ and P n as mapping class groups 

Our announced goal in this review was to concentrate on areas where there have been new 
developments in recent years. While it has been known for a very long time that Artin's 
braid group is naturally isomorphic to the mapping class group of an n-times punctured 
disc, people have asked us many times for a simple proof of this fact. We do not know of 
a simple one in the literature, therefore we present one here. In this case 'simple' does not 
mean intuitive and based upon first principles, rather it means using machinery which is 
normally available to a graduate student who has the tools learned in a first year graduate 
course in topology, and is preparing to begin research. 



Let 5 = Sg t b, n denote a 2-manifold of genus g with b boundary components and n 
punctures, and let Diff + (5) denote the groups of all orientation preserving diffeomorphisms 
of 5. Observe that we may assign the compact open topology to Diff + (5), making it into a 
topological group. The mapping class group M. = M. g ,b,n of 5 is 7ro(Diff + (5)), that is, the 
quotient of Diff + (S') modulo its subgroup of all diffeomeorphisms of 5 which are isotopic 
to the identity rel dS. We allow diffeomeorphisms in Diff + (5) to permute the punctures, 
writing DifF 1 " (Sg^.h) if they are to be fixed pointwise. Our interest in this article is mainly 
in the special case of A^o,l,n- 

Theorem 1 There are natural isomorphisms: 

B n = M Q ,i, n and P n M ,i,n 

Proof: We begin with an intuitive description of how to pass from diffeomorphisms to 
geometric braids and back again. Choose any h G Diff + (5o,i, n )- While h is in general 
not isotopic to the identity, its image i(h) in DifF 4 " (5*0.1,0) under the inclusion map i : 
Diff + (5o,i, n ) - * Diff + (5o,i,o) is, because Diff + (5o,i,o) = {!}• Let ht denote the isotopy. If 
the punctures in 5o,l,n are at (pi,P2, ■ ■ ■ ,Pn), then the n paths (h t (pi), h t (p2), • • • , ht(p n )) 
defined by the traces of the points (pi,P2, ■ ■ ■ ,Pn) under the isotopy sweep out a braid in 
5o,i,o x [0, 1], and the equivalence class of this braid is the image of the mapping class [h] 
in the braid group B n . 

It's a little bit harder to understand the inverse isomorphism, from the braid group to 
the mapping class group. One chooses a geometric braid and imagines it as being located in 
a slice of 3-space, with the bottom endpoints of the n braid strands (which are oriented top 
to bottom) as being pinned to the distinguished points pi,...,p n on the punctured disc. If 
one is very careful the n braid strings can be laid down on the punctured disc so that they 
become n non-intersecting simple arcs, each of which begins and ends at a base point. One 
then constructs a homeomorphism of the punctured disc to itself in such a way that the 
trace of the isotopy to the identity is the given set of n non-intersecting simple arcs. 

To prove the theorem, we begin by establishing the isomorphism between P n and -Mo,i,n- 
A good general reference for the underlying mathematics is Chapter 6 of the textbook [HE] • 
As previously noted Diff + (S'0,1,0) is a topological group. Also, Diff + (5o,i,r0 is a closed 
subgroup of Diff + (5*0,1,0) • The evaluation map £ : DifF 1- (S'0,1,0) — ► Co,n is defined by 
£(h) = (h(pi), . . . , h(p n )). It is clear that £ is continuous with respect to the compact 
open topology on Diff + (5o,i,n) and the subspace topology for Co,n C C x C x • • • x C. The 
topological group DifF" (5o,i,o) acts n-transitively on the disc in the sense that if (pi, . . . ,p n ) 
are n distinct points and (w±, . . . ,w n ) are n others then there is an h G Diff + (5o,i,o) such 
that h(pi) = Wi, i = 1, ... ,n. Observe that if h G Diff + (5o,i,n) then (h(p\), . . . , h{p n )) = 
(pi, . . . ,p n ) and if h, h! G Diff + (5o,i,o) with £(h) = £(h'), then h, h! are in the same left coset 
of Diff + (5o,i,n) in L)iff + (5o,i,o)- In this situation it is shown in |127j . part I, Sections 7.3 and 
7.4, that the 3-tuple (£, Diff + (5o,i,o), Co,n) is a fiber space, with total space Diff + (5o,i,o), 
base space Co,™, projection £ and fiber Diff + (5o,i,n)- (It is a good exercise for a graduate 
student to prove this directly by constructing the required local product structure in an 
explicit manner.) The long exact sequence of homotopy groups of a fibration then gives the 
following exact sequence of groups and homomorphisms, where we focus on the range that 
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is of interest: 



. . . - vr 1 (Diff+ (.So.i.o)) ^ 7ri(Co >A ) ^ 7ro(Diff+(S 0)M )) ^ 7r (Diff+(S 0) i,o)) h . . . 

The two end groups are trivial. The left middle group is P n and the right middle group is 
•Mo in- The isomorphism of Theorem ^ is <9*. Tracing through the mathematics one finds 
that in fact its inverse is the map that we described right after we stated the theorem. The 
assertion about P n is therefore true. The proof for B n can then be completed by comparing 
the short exact sequence (^Q) , which says that B n is a finite extension of P n with quotient 
the symmetric group with a related short exact sequence for the mapping class groups: 

1 M Qi ifi-±+M Q ,i, n - £„ — 1 (6) 

Thus -Mo,i, n is a finite extension of A4o,l,h with quotient the symmetric group E n . Com- 
paring corresponding groups in the short exact sequences (0) and ©, we see that the first 
two and last two are isomorphic. The 5-Lemma then shows that the middle ones are too. 
This completes the proof of Theorem [\]\\ 



1.4 Some examples where braiding appears in mathematics, unexpect- 
edly 

We discuss, briefly, a variety of examples, outside of knot theory, where 'braiding' is an 
essential aspect of a mathematical or physical problem. 

1.4.1 Algebraic geometry 

Configuration spaces and the braid group appear in a natural way in algebraic geometry. 
Consider the complex polynomial 

(X - zi)(X -z 2 )...{X- z n ) =X n + aiX™" 1 + . . . + a n ^X + a n 

of degree n with n distinct complex roots z%, . . . ,z n . The coefficients ai, . . . , a n are the 
elementary symmetric polynomials in {zi, . . . , z n }, and so we get a continuous map C n — > C n 
which takes roots to coefficients. Two points have the same image if and only if they differ 
by a permutation, so we get the same identification as in the quotient map r : Co,n — * Co,n, 
in quite a different way. Since we are requiring that our polynomial have n distinct roots, 
a point {ai, . . . , a n } is in the image of z under the root-to-coefficient map if and only if the 
polynomial X n + a\X n ~ l + . . . + a n has n distinct roots, i.e. if and only if its coefficients 
avoid the points where the discriminant 

A=nc*-*i) 2 > 

i<j 

expressed as a polynomial in {a±, . . . , a n }, vanishes. Thus Co, n (C) can be interpreted as the 
complement in C n of the algebraic hypersurface defined by the equation A = 0, where A 
is rewritten as a polynomial in the coefficients a\, . . . ,a n . (For example, the polynomial 
X 2 + a\X + a2 has distinct roots precisely when a\ — 4<i2 = 0). In this setting the base point 
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r(p) is regarded as the choice of a complex polynomial of degree n which has n distinct 
roots, and an element in the braid group is a choice of a continuous deformation of that 
polynomial along a path on which two roots never coincide. There is a substantial literature 
in this area, from which we mention only one paper, by Gorin and Lin [Slj. We chose it 
because it contains a description of the commutator subgroup H' n of the braid group, and 
many people have asked the first author for a reference on that over the years. While there 
may be other references they are unknown to us. 

1.4.2 Operator algebras 

Our next example, taken from the work of Vaughan Jones jEZMEEl; is interesting because it 
shows how 'braiding' can appear in disguise, so that initially one misses the connection. We 
consider the theory, in operator algebras, of 'type Hi factors', ordered by inclusion. Let M 
denote a Von Neumann algebra, i.e. an algebra of bounded operators acting on a Hilbert 
space h. The algebra M is called a factor if its center consists only of scalar multiples of the 
identity. The factor is type Hi if it admits a linear functional, called a trace, tr : M — > C, 
which satisfies the following three conditions: (i) tr(xy) = tr(yx) V x, y £ M, (ii) tr(l) = 1, 
and tr(xx*) > 0, where x* is the adjoint of x. In this situation it is known that the trace is 
unique, in the sense that it is the only linear functional satisfying the first two conditions. 
An old discovery of Murray and Von Neumann was that factors of type Hi provide a type of 
'scale' by which one can measure the dimension of h. The notion of dimension which occurs 
here generalizes the familiar notion of integer-valued dimensions, because for appropriate 
M and h it can be any non-negative real number or oo. 

The starting point of Jones' work was the following question: if M\ is a type Hi factor 
and if Mq C M\ is a subfactor, is there any restriction on the real numbers which occur as 
the ratio A = dim.M (h) /diniM 1 (h) ? The question has the flavor of questions one studies 
in Galois theory. On the face of it, there was no reason to think that A could not take on 
any value in [1, oo], so Jones' answer came as a complete surprise. He called A the index 
| Mi : Mo | of Mq in Mi, and proved a type of rigidity theorem about type Hi factors and 
their subfactors: 

The Jones Index Theorem: A C [4, oo] U {Acos2tt jp\ , where p G Z,p > 3. Moreover, 
each real number in the continuous part of the spectrum [4, oo] and in the discrete part 
{{Acos2ir/p},p S Z,,p > 3} is realized. 

What does all this have to do with braids? To answer the question, we sketch the idea 
of the proof, which is to be found in [S7j- Jones begins with the type Hi factor Mi and 
the subfactor Mq. There is also a tiny bit of additional structure: It turns out that in this 
setting there exists a map e\ : Mi — > Mq, known as the conditional expectation of Mi on 
Mq. The map e\ is a projection, i.e. e\ = e\. His first step is to prove that the ratio A is 
independent of the choice of the Hilbert space h. This allows him to choose an appropriate 
h so that the algebra M2 generated by Mi and e\ makes sense. He then investigates Mi 
and proves that it is another type Hi factor, which contains Mi as a subfactor, moreover 
IM2 : Mi| = I Mi : Mq\ = A. Having in hand another Hi factor M2 and its subfactor Mi, 
there is also a trace on M2 (which by the uniqueness of the trace) coincides with the trace 
on Mi when it is restricted to Mi, and another conditional expectation ei '■ M2 — * M\. 
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This allows Jones to iterate the construction, to build algebras Mi, Mi, ■ ■ ■ and from them 
a family of algebras { J n , n = 1, 2, 3, . . .}, where J n is generated by 1, ex, . . . , & n -\- 

Rewriting history a little bit in order to make the subsequent connection with braids a 
little more transparent, we now replace the projections e^, which are not units, by a new set 
of generators which are units, defining: = te^ — (1 — e&), where (1 — t)(l — i -1 ) = 1/A. The 
g^s generate J n because the e^'s do, and we can solve for the e^'s in terms of the g^'s. So 
Jn = Jn(t) is generated by 1, gi, g n -i and we have a tower of algebras, Ji(t) C Jiit) C 
ordered by inclusion. The parameter t, which replaces the index A, is the quantity now under 
investigation. It's woven into the construction of the tower. The algebra J n (t) has defining 
relations: 

9i9k = 9k9i if \i~k\> 2, 9i9i + i9i = gi+i9i9i+i, gf = (t - 1)^ + t, (7) 

1 + 9i + 9i+l + 9i9i+i + 9i+i9i + +9i9i+i9i = 0. 

Of course there are braids lurking in the background. If we rename the g[s, replacing g^ by 
and declare the a^s to be generators of a group, then the first two relations are defining 
relations in the group algebra CB n . The algebra J n (t) is thus a homomorphic image of the 
group algebra of the braid group. Loosely speaking, braids are encountered in Operator 
Algebras because they encode the way in which each type Hi factor Mj acts on its subfactor 
Mj_i. Braiding is thus involved in defining the associated extensions. We shall see later, 
in ^4.71 that a similar action, via group extensions, can be used to define representations of 
B n . 

To see the connection with knots and links, recall that since M n is type Hi it supports 
a unique trace, and since J n is a subalgebra it does too, by restriction. This trace is known 
as a Markov trace , i.e. it satisfies the important property: 

tr(wg n ) = f(t)tr{w) if w G J n , (8) 

where f(t) is a fixed function of t. Thus, for each fixed value of / the trace is multiplied 
by a fixed scalar when one passes from one stage of the tower to the next, if one does so by 
multiplying an arbitrary element of J n by the new generator g n of J n +i- The Jones trace 
is nothing more or less than the 1-variable Jones polynomial |89j associated to the knot or 
link which is obtained from the closed braid. We will have more to say about all this in 

sua 

1.4.3 Homotopy groups of spheres 

As before, let P n +i denote the pure braid group on n + 1 strands. For each i = 1, . . . , n + 1 
there is a natural homomorphism p, : P n +i — * P n , defined by pulling out the i th strand. 
The group of Brunnian braids is 

BR n+1 = n£" +1 kernel( K ), 

i.e. a braid is in BR n+ \ if and only if, on pulling out any strand, it becomes the identity 
braid on n strands. Brunnian braids have received some attention in knot theory. 

Braids have played a role in homotopy theory for many years, most particularly in the 
work of F. Cohen and his students (see for example [TU]). but during the past few years 
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the connection was sharpened when it was discovered that there is an embedding of a free 
group F n in P n +i with the property that a well-defined quotient of BR n+ i n F n (a little bit 
too complicated to describe here) is isomorphic to TT n+ iS 2 . It remains to be seen whether 
new knowledge about the unidentified higher homotopy groups of spheres can be obtained 
through the methods of |1U| . 

1.4.4 Robotics 

Our fourth example is an application of configuration spaces to robotics. It shows the braid 
group popping up in an unexpected way (until you realize how natural it is). Robots, or 
AGVs (automatic guided vehicles), are required to travel across a factory floor that contains 
many obstacles, en route to a goal position (e.g. a loading dock or an assembly workstation). 
The problem is to design a control system which insures that the AGVs not collide with the 
obstacles, or with each other, and complete the task with efficiency with regard to various 
work functionals. Here is how configuration spaces appear: The underlying space in this 
simple example is the workspace floor X, from which a finite set O of obstacles are to be 
removed. The configuration space of n non-colliding AGVs is then precisely Cq )TI {X — O). 
More generally, X — O is replaced by a finite graph Y , and the the braid group B n by the 
braid group TTi(Co >n (Y)) of the graph. There is a vast literature on this subject; we suggest 
[78] by R. Ghrist, as a starter. 

1.4.5 Public key cryptography 

In this example braids are important for rather different reasons than they were in our 
earlier examples. In our earlier examples the underlying phenomenon which was being 
investigated involved actual braiding, albeit sometimes in a concealed way. In the example 
that we now describe particular properties of the braid groups B n , n = 1,2,3, .. ., rather 
than the actual interweaving of braid strands, are used in a clever way to construct a new 
method for encrypting data. 

The problem which is the focus of 'public key cryptography' will be familiar to everyone: 
the security of our online communications, for example our credit card purchases, our ATM 
transactions, our cell phone conversations and a host of other transactions that have become 
a part of everyday life in the 21 s * century, The basic problem is to encrypte or translate a 
secret message into a code that can be sent safely over a public system such as the internet, 
and decoded at the receiving end by the use of a secret piece of information known only 
to the sender and the recipient, the 'key'. The problem that must then be solved is to 
establish a private key that will be known only to the sender and the recipient, who will 
then be able to exchange information over an insecure channel. In recent years much work 
has been done on certain codes which are based upon the assumption that the word problem 
has polynomial growth as braid index n is increased, whereas the conjugacy problem does 
not. But in §5\ we will review recent work on the word and conjugacy problems in the 
braid groups, and show that such an assumption seems problematic at best. See $3 and in 
particular the discussion in ^5.51 below. 
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2 From knots to braids 



In this chapter we will explore, for the benefit of readers who are new to the subject, the 
foundations of the close relationship between knots and braids. We will first describe the 
straightforward process of obtaining a knot or a link from a given braid by 'closing' the 
braid. This leads us directly to formulate two fundamental questions about knots and 
braids. First, is it always possible to transform a given knot into a closed braid? This 
question will be answered in the affirmative in Theorem [21 first proved by Alexander in 
1928 in [2]. The correspondence between knots and braids is clearly not one-to-one (for 
example, conjugate braids yield equivalent knots), leading naturally to the second question: 
which closed braids represent the same knot type? That question is addressed in Theorem 
BJ first formulated by A. Markov in [104] . which gives 'moves' relating any two closed braid 
representatives of a knot or link, while simultaneously preserving the closed braid structure. 

Together, Theorems [21 and 0] form the cornerstone of any study of knots via closed braids, 
so we feel obliged to prove them. Among the many proofs that have been published of both 
over the years, we have chosen ones that we like but which do not seem to have appeared 
in any of the review articles that we know. The proof that we give of Alexander's Theorem 
is due to Shuji Yamada [136] . with subsequent improvements by Pierre Vogel [135] . The 
algorithm is elementary enough to be accessible to a beginner, and has the advantage for 
experts of being suitable for programming. The proof that we present of Markov's theorem 
is due to Pawel Traczyk 129 . It is relatively brief, as it assumes Reidemeister's well-known 
theorem about the equivalence relation on any two diagrams of a knot, Theorem [31 below, 
building on methods introduced in the proof of Theorem [21 

2.1 Closed braids 

For simplicity, let us begin with a planar diagram of a given geometric braid. To obtain 
a knot or link, one simply 'closes up' the ends of the braid as in Figure El The pre-image 




Figure 3: The operation of closing a braid X to form a closed braid 

in M 3 of the 'center point' shown in Figure under the usual projection map is called the 
axis of the braid. (If one wishes to consider the knot in S* 3 , then we include the point at 
infinity so that the braid axis is an embedded S .) We then orient the resulting knot or 
link in such a way that the strands of the braid are all travelling counterclockwise about 
the braid axis. The knot or link type resulting from performing this operation on a braid 
X is known as the closure of X and will be denoted by b(X). The same notation may also 
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refer to the particular diagram as in Figure |31 

Equivalently, consider a knot K C S 3 . Suppose there exists A = h^S 1 ) where h is an 
embedding and Z is unknotted in S 3 and contained in the complement of K. Suppose 
further that we choose the point at infinity {00} to be in A and, using standard cylindrical 
coordinates (p, 9, z) on R 3 , identify the resulting copy of E = A — {00} with the z-axis 
in M 3 = S 3 — {00}. If we always have d6/dt > as we travel about the knot K with an 
appropriate cylindrical parametrization, then we say that K is a closed braid with respect 
to the axis A. The closed braid diagram of Figure El is then obtained by projection parallel 
to the direction defined by A onto a plane that is orthogonal to A. 

2.2 Alexander's Theorem 

As we just observed, it is a simple matter to obtain a knot or link from a braid. The classical 
theorem of J. Alexander allows us to reverse this process, though not in a unique way: 

Theorem 2 (Alexander's Theorem |2j) Every knot or link in S 3 can be represented as 
a closed braid. 

Proof: Alexander's original proof was algorithmic, i.e. it gave an algorithm for transform- 
ing a knot or link into closed braid form. While it is straightforward, we do not know of 
any computer program based upon it. We shall give instead a rather different and newer 
algorithm originally due to Yamada |136| . as later improved by Vogel |135| . We like it 
for two reasons: (1) It has a beautiful corollary (see Corollary 12.11 below) which reveals 
structure about knot diagrams that had not even been conjectured by any of the experts 
before 1987, even though there was abundant evidence of its truth; (2) It leads, very easily, 
to an efficient computer program for putting knots into braid form. In this regard we note 
that when Jones was writing the manuscript [HE] > which resulted in his award of the Fields 
medal, he computed closed braid representatives for the 249 knots of crossing number less 
than or equal to 10, constructing the first table known to us of closed braid representatives 
of knots. His list remains extremely useful to the workers in the area in 2004. Yamada's 
work was not yet known when he did that work, and there did not seem to him to be a 
good way to program the Alexander method for a computer, so he calculated them one at 
a time by hand. The amount of work that was involved can only be appreciated by the 
reader who is willing to try a few examples. 

In order to prove Alexander's Theorem, we shall first present the Yamada- Vogel algo- 
rithm for transforming a knot into closed braid form in full, followed immediately by an 
illustrative example (Example 12.11) . We shall then prove Alexander's Theorem by showing 
that it is always possible to perform the steps of the Yamada- Vogel algorithm for any given 
knot or link and that the algorithm always leads to a closed braid. 

The Yamada- Vogel algorithm draws on Seifert's well-known algorithm for using a dia- 
gram of an oriented knot or link K to construct a Seifert surface for K (see |123) or jlOlj . 
e.g., for a thorough treatment of Seifert's algorithm), and we will need some related termi- 
nology. Let C and C be two oriented disjoint simple closed curves in S 2 . Then C and C 
cobound an annulus A. We say that C and C are coherent (or coherently oriented) if C and 
C represent the same element of H\(A). Otherwise we say that C and C are incoherent . 
Following Traczyk |129j . we define the height of a knot diagram D, denoted h(D), to be the 
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number of distinct pairs of incoherently oriented Seifert circles which arise from applying 
Seifert's algorithm to D. The height function gives us a useful characterization of a closed 
braid: a diagram D represents a closed braid if and only if h(D) = 0. (Recall that D lives 
inS 2 .) 

The Yamada-Vogel Algorithm 

1. Let D be a diagram of an oriented knot K. Smooth all crossings of D as in Seifert's 
algorithm to obtain n Seifert circles Ci, . . . , C n . Record each original crossing with a 
signed arc: (+) for a positive crossing (often called a right-handed crossing), (— ) for 
a negative (or left-handed ) crossing (see Figure I2fiv) ) . The resulting diagram is the 
Seifert picture S corresponding to the diagram D. Note that any two circles joined 
by a signed arc in any Seifert picture are necessarily coherent. For an example that 
illustrates the construction of a Seifert diagram, see the passage from the bottom left 
to the top left sketches in Figure |2J 




Figure 4: The Yamada-Vogel algorithm performed on the knot 52- 



2. If h(D) = 0, the knot K is already in closed braid form, and we are done. If h(D) > 0, 
we can find a reducing arc a, i.e., an arc joining an incoherent pair Cj, Cj such that a 
intersects S only at its endpoints. Reducing arcs are illustrated as heavy black arcs in 
the example in Figure EJ A component of S 2 \ S which admits a reducing arc is called 
a defect region. Perform a reducing move along a, as shown in Figure |S1 to obtain a 
new Seifert picture S' in which a pair of coherent Seifert circles, C a and C z , joined 
by two oppositely signed arcs, replaces the incoherent pair Cj, Cj. The corresponding 
move on the original diagram D is a Reidemeister move of type II in which we slide 
Ci over Cj in a small neighborhood of the arc a to obtain a new diagram D' with two 
new crossings. Note that if we instead slide Cj under Cj, we obtain the same two new 
Seifert circles but the signs of the two new signed arcs are now switched. 
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Figure 5: The local picture of a reducing move. 

3. Continue performing reducing moves on incoherent pairs until a diagram with height 
zero is obtained. 

Example 2.1 We apply the Yamada-Vogel algorithm to the diagram of the knot 52, pic- 
tured in Figure ^ (The reader can check that this is the first example in the knot tables 
of a knot diagram with height greater than zero.) The first stage shows the Seifert picture 
associated to the original diagram, consisting of 4 Seifert circles, with 5 signed arcs (all 
positive) recording the original crossings. We see that the original knot diagram has height 
2. The figure shows a choice of reducing arc, a%, joining one of the two pairs of incoherent 
circles. 

In the third sketch of Figure we see the new Seifert picture resulting from performing 
the reducing move along a%. Note that we have introduced two new crossings of opposite 
sign. We also see a new reducing arc, Q2, joining the only remaining pair of incoherent 
circles. We see in the fourth sketch the Seifert picture with height zero resulting from the 
second reducing move performed along 02- At this point, we are done, but in the final 
sketch we see a different planar projection of the same Seifert picture which allows us easily 
to read off a braid word associated to the knot: beginning with the positive signed arc 
in the 'twelve o'clock' position and reading counter-clockwise, we see that the knot 52 is 
equivalent to b(X), where X = o^crf 1 (?2&z l 0-20-^20-^2 ■ 

We may learn several things from this simple example. Applying the braid relations to 
the word defined by X, we see that X = <72&\ l (T 2°'^ 0-20-^20-^2 = o-20~i 1 °~2 ~3 lo ~i ~2 ~i°~30~2 = 
&2&i l (J2V1V3 1 ^2<730"i£T2 = o" 2 o"i (72 o" 3 (T^ GyOi- Since this braid only involves 03 once, 

we may 'delete a trivial loop to get the 8-crossing 3-braid <72<7f ^ci^f^ 10 "! ^, go ^ e a ^g _ 
rithm did not give us minimum braid index. The algorithm also does not give shortest words 
because our 8-crossing braid may be shortened to the 6-crossing 3-braid o^c^o^erfo^. In 
fact, 6 is minimal, because the crossing number of a 3-braid knot must be even, and this 
knot has no diagram with fewer than 5 crossings, so 6 is minimal. So we may deduce one 
more fact: when we use the Yamada-Vogel algorithm to change a knot which is not in closed 
braid form to one which is, the crossing number goes up. A 

To prove Alexander's Theorem, we first need to show that a reducing move strictly 
decreases the height of a diagram. This lemma is sometimes stated as 'obvious' in the liter- 
ature, but the question arises frequently enough to warrant a short but thorough argument. 

Lemma 2.1 Suppose a reducing move is performed which transforms a diagram D to a 
diagram D' . Then h(D') = h(D) - 1. 

Proof. Let C\, . . . , C n be the Seifert circles in the Seifert picture S corresponding to the 
diagram D. Let Ci,Cj be an incoherent pair. The union Ci U Cj separates the 2-sphere 
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into three components: an annulus A cobounded by Cj and Cj, and two disks, T>i and T>j 
bounded by C and Cj, respectively. Suppose that A admits a reducing arc a. A reducing 
move along a preserves circles C p ,p ^ i,j and replaces Cj and Cj with two new circles, one 
of which necessarily bounds a disk T> a containing no other Seifert circles in the new Seifert 
picture S'. We denote this circle by C a (see Figure EJ). The other new circle, denoted C z , 
bounds a disk D z containing all Seifert circles originally contained in the annulus A in S. 

To simplify the bookkeeping, we shall write (C r , C s ) = 1 if the pair C r , C s is coherent, or 
else (C r , C s ) = —1 if C r , C s are incoherent. Obviously, if {p, q} n {i, j} = 0, then (C p , C q ) is 
unchanged by the reducing move, so we need only consider the effect of the reducing move 
on (C p , C x ), where x = i or x = j and p / Now if C p is contained in the annulus A 
in S (and hence in V z in S'), then clearly (C P ,C Z ) = (C p ,C a ) = (C p ,Ci) = (C p ,Cj). Also, 
if C p C Vi in S, then (C P ,C Z ) = {C p ,d) and [C p ,C a ) = (C p ,Cj). Similarly, if C p C Vj 
in S, then (C P ,C Z ) = (C p ,Cj) and (C p ,C a ) = (C p ,Ci). Therefore the number of distinct 
incoherent pairs C r ,C s in S with {r, s} ^ {i,j} is equal to the total number of distinct 
incoherent pairs in S' of the form C r , C s with {r, s} ^ {a, z}. By construction, however, we 
have replaced (C^Cj) = -1 with (C z ,C a ) = 1. Thus h(D') = h(D) - 1. || 

The previous lemma tells us that the Yamada-Vogel algorithm will always lead to a 
diagram of height zero, i.e., a closed braid, as long as it is always possible to perform Step 
3. Therefore the following lemma, whose proof was suggested to us by Bryant Adams, will 
conclude the proof of Alexander's Theorem. 

Lemma 2.2 |136j Let D be a knot or link diagram. If h(D) > 0, then the Seifert picture 
S associated to D contains a defect region. 

Proof: Each component 1Z of S 2 \S is a surface of genus with k > 1 boundary components. 
Each boundary component of 1Z is a union of some number of signed arcs (possibly zero) 
and subarcs of Seifert circles. We call the collection of Seifert circles in S which form part 
or all of a boundary component of 1Z the exposed circles of 1Z. 

Let us now examine the possible ways in which 1Z could fail to be a defect region. 
Certainly, if 1Z has one exposed circle, then it is not a defect region. If TZ has two exposed 
circles, then TZ is either a disk or an annulus. In this case, if TZ is a disk, then its two 
exposed circles are joined by at least one signed arc; hence the two circles are coherent and 
TZ is not a defect region. If TZ is an annulus, either its two exposed circles are incoherent, 
in which case it is a defect region, or else they are coherent and TZ is not a defect region. 
If TZ has three or more exposed circles, then there is necessarily one incoherent pair among 
them, and TZ is a defect region. 

Suppose that no component of S 2 \ S is a defect region and hence that each component 
is of one of the three types of non-defect regions described above. It is clear that we must 
have at least one region of the second type, since otherwise h(D) is clearly zero. We can 
think of such a region as lying between two nested, coherent circles joined by at least one 
signed arc. 

Let us now start with such a region, and try to build a diagram with no defect regions. 
We cannot add any circles in the annulus cobounded by the two nested circles, since this 
necessarily gives rise to at least one component with three or more exposed circles. In fact, 
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our only option is to add coherent circles which nest with the original two circles (and as 
many signed arcs between adjacent pairs as we like). However, such a diagram has height 
zero. Therefore, h{D) > implies that a defect region exists. This completes the proof of 
Lemma 12.11 and so also of Theorem [21 | 

The braid index of a knot or link K is the minimum number n such that there exists 
a braid X £ B n whose closure b(X) represents K. (We note that it is also common to 
refer to the index of a braid or a closed braid, meaning simply the number of its strands or 
the number of times it travels around its axis, respectively.) It is clear that the minimum 
number of Seifert circles in any diagram of a knot or link K is bounded above by the braid 
index of K. It is equally clear from the Yamada-Vogel algorithm that the reverse inequality 
holds. Thus we obtain the following corollary, which is due to Yamada |136| . It seems 
remarkable that it was not noticed long before 1987. 



Corollary 2.1 136 The minimum number of Seifert circles in any diagram of a knot or 
link K is equal to the braid index of K . 

It also follows that we have a measure of the complexity of the process of transforming a 
knot into closed braid form as follows. 

Corollary 2.2 ( 129 , 135 ) Let N denote the length of any sequence of reducing moves 
required to transform a diagram D into closed braid form. Then we have: 

iv =ftW < (n - 1) 2 ( "- 2) . m 

where n is the number of Seifert circles associated to D. 

Open Problem 1 It is an open problem to determine, among all regular diagrams for a 
given knot or link, the minimum number of Seifert circles that are needed. By Corollary 
Othis is the same as the minimum braid index, among all closed braid representatives of 
a given knot or link. We know of only one general result relating to this problem, namely 
the Morton- Franks- Williams inequality of |lllj and [21]. It will be discussed briefly in i J4.3l 
The literature also contains an assorted collection of ad-hoc techniques for determining the 
braid index of individual knots. For example, see the methods used in [HJ to prove that 
the 6-braid template in Figure l2"Ul below actually has braid index 6, which rests on the fact 
that non-trivial braid-preserving flypes always have braid index at least 3. £ 



2.3 Markov's Theorem 

To introduce the main goal of this section, we begin by recalling for the reader Reidemeister's 
theorem, which dates from the earliest days of knot theory. It was assumed and used (as a 
folk theorem) long before anybody wrote down a formal statement and proof. 

Theorem 3 (Reidemeister's Theorem) Let D,D' be any two (in general not closed 
braid) diagrams of the same knot or link K. Then there exists a sequence of diagrams 
D = D\ — > L>2 Dk = D' such that any -Dj+i in the sequence is obtained from Di 

by one of the three Reidemeister moves, depicted in Figure 
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Figure 6: The 3 Reidemeister moves. The 1, 2 or 3 strands in the left sketch of each have 
arbitrary orientations, also we give only one of the possible choices for the signs of the 
crossings, for each move. 



Proof: We refer the reader to for a complete proof. 

Alexander's Theorem, proved in the last section, guarantees us that closed braid repre- 
sentatives of a knot exist, but as previously noted, they are certainly not unique. Markov's 
Theorem, first stated in [104) with a sketch of a proof, gives us a certain amount of control 
over different closed braid representatives of the same knot. It asserts that any two are re- 
lated by a finite sequence of elementary moves and serves as the analogue for closed braids 
of the Reidemeister Theorem for knots. 

One of the moves of the Markov Theorem is braid isotopy. From the point of view 
of a topologist, braid isotopy means isotopy of the closed braid, through braids, in the 
complement of the braid axis. Morton has proved that if two braids have closures that 
are braid isotopic, then they are conjugate in B n e 108 . The other two moves that we 
need are mutually inverse, and are illustrated in Figure [7| as a move on certain (w + 2)- 
braids. We call them destabilization and stabilization , where the former decreases braid 
index by one and the latter increases it by one. The weight w that is attached to one of 
the braid strands in Figure [7] denotes that many 'parallel' strands, where parallel means in 
the framing defined by the given projection. The braid inside the box which is labelled P is 
an arbitrary (w + l)-braid. Later, it will be necessary to distinguish between positive and 
negative destabilizations, so we illustrate both now. 



P 

w /+ 



o 
A 



destabilize ■ 



stabilize 



o 
A 




destabilize 



stabilize 



o 
A 



Figure 7: The destabilization and stabilization moves. 



Theorem 4 (Markov's Theorem) Let X, X' be closed braid representatives of the same 
oriented link type K in oriented 3-space. Then there exists a sequence of closed braid 
representatives of K: 

X = X\ — > X2 — * • • • — * X r = x 1 
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taking such that each Xj+i is obtained from Xi by either (i) braid isotopy, or (ii) a single 
stabilization or destabilization. 

We call the moves of Theorem 0] Markov moves , and say that closed braids that are 
related by a sequence of Markov moves are Markov-equivalent. 

Forty years after Markov's theorem was announced, the first detailed proof was published 
in ^Sj. At least 5 essentially different proofs exist today. See for example jllUj . in which 
Morton gives his beautiful threading construction for knots and braids which also yields 
an alternate proof of Alexander's Theorem. Here we shall present a proof due to Pawel 
Traczyk 129 . It begins with Reidemeister's theorem, and uses the circle of ideas that were 
described in the previous section, and so it is particularly appropriate for us. 

Proof: We are given closed braids X,X' which represent the same oriented knot type 
K. Without loss of generality we may assume that X and X' are defined by closed braid 
diagrams Y,Y' of height h(Y) = h(Y') = 0. By Theorem El we know there is a sequence 
of knot diagrams Y = Y\ —* Y2 — > • • • — * Y^ = Y' , where in general h(Yi) > for i = 
2, . . . , k— 1, such that any two diagrams in the sequence are related by a single Reidemeister 
move of type I, II or III. The first step in Traczyk's proof is to reduce the proof to sequences 
of knot diagrams which are related by Yamada-Vogel reducing moves: 

Lemma 2.3 It suffices to prove Theorem^for closed braid diagrams Y, Y' which are related 
by sequences Y = Y\ — » Y2 — * ■ ■ ■ — > Y q = Y' with the properties (i) h(Y) = h(Y') = 0, (ii) 
h(Yi) > for i = 2, ... ,q — 1, and (iii) Yi + \ is obtained from Yi by a single Yamada-Vogel 
reducing move or the inverse of a reducing move. 

Proof: We may always assume that the diagrams Y2, . . . , Y q -\ have height > 0, for if not 
we simply replace the given sequences by the subsequences joining any two intermediate 
diagrams of height zero. 

We say that a Reidemeister move is braid-like if the strands that are involved in it are 
locally oriented in a coherent fashion, as they would be if the diagram is a closed braid. In 
particular, any Reidemeister move of type I is braid-like. To begin the proof of the lemma, 
we establish a somewhat weaker result: we claim that we can get from Y to Y' via a finite 
sequence of the following four types of moves and their inverses: 

• a braid-like Reidemeister move of type I, denoted type l b , 

• a braid-like Reidemeister move of type II, denoted type II b , 

• a braid-like Reidemeister move of type III, denoted type III 6 , 

• a Yamada-Vogel reducing move, denoted type y. 

To prove the claim, it is enough to show that non-braid-like Reidemeister moves, which we 
denote by the symbols I nb , II nb and III nb , can be achieved via a finite sequence of moves 
of type I fe , II b , III b and y. To prove this, we examine the cases I nb , lll nb and II n6 in that 
order: 

1. As previously noted, any type I Reidemeister move is of type I 6 . 

2. A type III rab Reidemeister move involves three arcs of the knot or braid. There are 
many different cases, depending on the local orientations and the signs of the 3 cross- 
ings, but they are all similar. One of the possible cases is given by the first and last 
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sketches of Figure |S1 where an arc passing under a crossing formed by the other two 
arcs is locally oriented opposite to the other two strands. The replacement sequence 
that is given in Figure |S1 shows that our type IIP Reidemeister move can be achieved 
by a sequence consisting of a type ll nb move, an isotopy, a type III 6 move and finally 
another type ll nb move. We leave the other type III" 6 cases to the reader, and we 
have reduced to the case of moves of type II n6 . 




Figure 8: Replacing moves of type III 

3. A move of type II™ 6 may be regarded as a move of type y^ if the arcs that are involved 
belong to distinct Seifert circles, so we only need to handle the case where they are 
subarcs of the same Seifert circle. This is done in Figure |5J where it is shown that the 
move can be replaced by two moves of type l b (which create two new Seifert circles) 
followed by a move of type y and another of type y~ l . This proves the claim. 






Figure 9: Replacing moves of type II 
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We are thus reduced to the case in which each diagram in the sequence taking Y to Y' 
is either type I 6 , II 6 , III 6 or y^. To complete the proof of Lemma 12.31 let t be a braid-like 
Reidemeister move to be performed on diagram Yi. Suppose that hi = h(Y{) > 0. Then we 
can find a sequence of reducing moves r±, . . . ,7-^ such that r n i o ■ ■ ■ o r\(Y,j) is a braid and 
such that the associated reducing arcs a%, . . . , a^. are each disjoint from the region in which 
t is to be performed. Thus each reducing move r,- commutes with t, and we can replace t 
with its 'conjugate' r^ 1 o • • • o r^ 1 o t o r ni o ■ ■ ■ o r\, so that t is now performed at height 0, 
i.e., on a braid. 

If t is of type II 6 or III 6 , then we are done, since a braid-like move of type II or type III 
performed on a braid is a braid isotopy. If t is of type I 6 , it is a stabilization (up to isotopy) 
only if it is performed on the braid strand 'nearest' to the braid axis. However, it is not hard 
to see how to realize a type I 6 move on an arbitrary strand in terms of Markov moves: simply 
push the strand under the others via type II 6 moves, and perform the required stabilization 
in a neighborhood of the braid axis. Note that to pass the resulting 'kink' back under a 
neighboring strand in a braid requires first a type III 6 move followed by a type 3^ _1 move 
(the 'kink' is always its own Seifert circles, so two distinct circles are necessarily involved). 
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Thus we can return the strand with the 'kink' in it back to its original position by repeated 
applications of this two-step process. Since we have just seen that any type II 6 or type lll b 
move can be realized by a finite sequence of type y^ moves and braid isotopies, this means 
that a type I b move can also be replaced by a finite sequence of type 3^ moves and braid 
isotopies. We can handle inverse moves of type I b in a similar fashion 

We have thus replaced our original sequence relating Y to Y' by a new one which 
is in general much longer, but which consists entirely of Markov moves (performed, by 
definition, on braid diagrams, i.e., on diagrams of height zero) and moves of type y ± . To 
be precise, our original sequence from Y to Y' may be replaced by a sequence of the form 
Y = Yq, . . . , Y ai , . . . , Y a2 , . . . , Y an = Y' , where h(Y ai ) = for all i and in each subsequence 
Y ai , . . . , Y ai+1 , either (1) each diagram in the subsequence has height zero and adjacent 
diagrams are related by a single Markov move, or (2) all the intermediate diagrams have 
strictly positive height and adjacent diagrams are related by a single move of type y or 
y^ 1 . Therefore in order to prove Markov's theorem, it suffices to consider only sequences 
of the second type, and the proof of Lemma l2~31 is complete. | 

Remark 2.1 The astute reader will have noticed the following: we have eliminated Reide- 
meister moves completely (they will not appear in the arguments that follow), nevertheless 
they played an important role already. We started with a sequence relating the given braids 
X and X' that consisted entirely of Reidemeister moves. We replaced it with a sequence of 
reducing moves and braid-like Reidemeister moves. The latter are in general not applied to 
diagrams of height zero, but we changed them to apply to diagrams of height zero. That is 
the moment when Traczyk's braid-like Reidemeister moves were changed to Markov moves. 
The modified sequence from X to X' has changed to a series of subsequences, each of which 
starts with a closed braid and ends with a closed braid, after which the ending braid is 
modified (by Markov moves) to a new closed braid, which is the initial closed braid in the 
next subsequence. As will be seen, Markov moves will not be used explicitly again until the 
proof of Lemma 12.81 where they are used (without the help of Reidemeister's Theorem) to 
relate very special closed braid diagrams. 

Consider now a sequence of diagrams Y = Y\ , . . . , Y n = Y' satisfying the criteria of 
Lemma 12.31 We note that, as in the above proof, we shall not in general distinguish 
between a diagram and its associated Seifert picture. Thus we shall make reference to 'a 
Seifert circle in the diagram Y{\ for example, meaning a Seifert circle in the Seifert picture 
associated to Y{. In fact, we can think of each circle in a Seifert picture as forming part of 
the associated diagram, except in a small neighborhood of signed arcs, which correspond to 
crossings. Since reducing arcs avoid signed arcs, there is no ambiguity when referring to 'a 
reducing arc in a diagram'. 

We now wish to consider the graph of the height function on our sequence. The graph 
will begin and end at height zero; each 'step' in between will either take us up 1 or down 1 
since we have reduced to the case where all moves are reducing moves (or their inverses). We 
will examine local maxima in the height function. Let Y(r),Y ,Y(s) be three consecutive 
diagrams in our sequence such that the height function has a local maximum at Y. In 
other words, we have two reducing moves r, s with corresponding arcs a r , a s in Y such that 
reducing Y along a r (resp. a s ) results in the diagram Y(r) (resp. Y(s)), and it makes sense 
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discuss Q r Ua s . We will call such a triple {Y(r), Y, Y(s)} a peak in the height function of our 
sequence. We define the height of the peak to be h(Y) and define the height of the sequence 
to be the maximum value attained by the height function on the sequence, in other words, 
the maximum over the height of all the peaks in the sequence. In order to prove Theorem^! 
we are going to induct on the height of the sequence. 

Lemma 2.4 We may assume that the reducing arcs involved in any peak in the height 
function of our sequence are disjoint. Further, the adjustments in our sequence of reducing 
moves which are required preserve the height of the sequence. 

Proof: Let {Y(r),Y, Y(s)} be a peak in the height function with associated reducing arcs 
a r and a s . We may always assume that the arcs intersect transversally and minimally. 
Suppose that |a r na s | = n > 2. By smoothing out one or more of the points of intersection, 
we find a new reducing arc a r > with the same endpoints as a r such that a r Pia r ' = and |a r 'H 
a s \ < n. We can then replace the given peak {Y(r),Y ,Y(s)} with two consecutive peaks 
{Y(r),Y, Y r i) and (Y r /,Y , Y(s)}. We call this procedure inserting the reducing operation r' 
at Y, and it essentially amounts to replacing one peak with two peaks of the same height. 
In this way, we continue on until the intersection numbers of all adjacent pairs is at most 1. 

Now suppose the arcs a r , a s associated to a given peak {Y(r),Y, Y(s)} have intersection 
number 1. If there exists a reducing arc at such that at(~)a r = at Pi a s = 0, then we can 
insert the reducing operation t at Y to produce two peaks, each with a disjoint pair of 
associated reducing arcs. Suppose that the defect region which supports a r and a s contains 
no third reducing arc which is disjoint from both a r and a s . There is only one possible 
arrangement for such a defect region, shown in Figure ITU1 




Figure 10: The case where two reducing arcs at a peak intersect once. 

With this arrangement, the reducing arcs a r and a s must act on four distinct Seifert 
circles, possibly joined by signed arcs. We will now examine the region labelled R which lies 
'outside' of the circles involved in our defect region and the signed arcs which join them. If 
R contains a Seifert circle, then it is easy to check that there must be a region somewhere in 
the diagram with three exposed circle. Such a region, as observed in the previous section, is 
necessarily a defect region which would contain a reducing arc disjoint from both a r and a s . 
If, on the other hand, R contains no Seifert circles, then it contains no signed arcs either, 
since all possible signed arcs between the four exposed circles of region R already appear 
in Figure E3 Thus we can join either pair of diagonally opposed circles by a reducing arc 
in R. 



22 



We conclude that if a peak {Y(r), Y, Y(s)} has associated arcs of intersection 1, we can 
always find a third reducing arc at so that we can insert the reducing operation t at Y, 
thereby replacing the original peak with two peaks, each having a disjoint pair of reducing 
arcs. Since the operation of inserting a reducing operation at a peak preserves the height 
of the sequence, this finishes the proof of the lemma. | 

Thanks to the previous lemma, we can assume from now on that each peak in the graph 
of our height function corresponds to a disjoint pair of reducing arcs. Before we can state 
the next lemma, which concerns the peaks in the height function, we need to introduce a 
few some concepts. Note that when the reducing arcs involved in a peak {Y(r), Y, Y(s)} 
are disjoint, the reducing moves commute, that is, they can be performed in either order, 
starting with the diagram Y and resulting in the same diagram Y' . Further, as long as the 
reducing arcs a r ,a s act on 3 or 4 distinct Seifert circles, we can perform the two reducing 
moves in either order with the same result. Observe that, since the reducing arcs a r ,a s 
are disjoint in the diagram Y, it makes sense to talk about the arc a s (resp. a r ) in the 
context of the diagram Y(r) (resp. Y(s)) obtained by reducing Y along a r (resp. a s ). In 
this case we say we have a 'commuting pair' of reducing moves associated to the peak. If a 
peak {Y(r),Y ,Y(s)} has a commuting pair, then we may replace it by a 'valley', that is, 
a subsequence {Y(r), Y' , Y(s)} where h(Y') = h(Y) — 2 and Y' = Y(s o r) = Y(r o s) is 
the result of reducing Y(r) along a s , or equivalently reducing Y(s) along a r . Thus we can 
eliminate any peak corresponding to a commuting pair (such a peak necessarily has height 
at least 2). 

In the case where two reducing arcs at a peak act on the same 2 circles, then after one 
move is performed, the second Reidemeister move will no longer be a reducing move; we call 
this a 'non-commuting pair' of reducing moves. Let {Y(r),Y, Y(s)} be a peak corresponding 
to a non-commuting pair of reducing arcs, and let C\, C2 be the two Seifert circles involved. 
Suppose there is a reducing arc at such that atC\a r = at Ha s = and such that t involves a 
circle other than C\ or C2, then we can insert t at Y to replace our peak {Y(r), Y, Y(s)} with 
two new peaks with commuting pairs of reducing moves: {Y (r ), Y, Y(t)} and {Y(t),Y, Y(s)}. 
As above, we now replace each peak with a 'valley': {Y(r), Y', Y(t)} and {Y(t), Y" , Y(s)}, 
respectively, where Y' = Y{t or) = Y{r o t) is the diagram resulting from reducing Y(r) by 
t (or equivalently, from reducing Y(t) by r, and Y" is the diagram resulting from reducing 
Y(s) by t (or equivalently, from reducing Y(t) by s). Again, this implies that the height of 
the original non-commuting peak {Y(r), Y, Y(s)} was at least 2. Thus, we can replace such 
a peak with peaks of strictly smaller height and repeat this process until all peaks either 
have height 1 or do not admit such a reducing arc at as above; we call a peak of the latter 
type irreducible . 

We can now state the next lemma: 

Lemma 2.5 We may assume that any peak in the height function of our sequence either 
has height 1 or is irreducible. 

Proof: The proof is clear. In the discussion that preceded the statement of the Lemma 
we defined a peak to be irreducible in such a way that it subsumed all possibilities which 
did not allow us to reduce to height 1. | 

We note that each condition of Lemma f2 . 5l necessarilv implies the non-commuting condition. 
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Lemma 2.6 We may assume that no peaks in the height function of our sequence have 
height 1. 

Proof: Let {Y(r),Y ,Y(s)} be a peak of height 1. We recall that height 1 implies that 
the two reducing arcs are non-commutative and hence involve precisely two circles, it is 
an easy exercise to show that these two circles must live either on the 'inside' or on the 
'outside' of a band of circles, and that a r and a s are in fact equivalent as reducing moves. 
Therefore the diagram Y(r) is equivalent to Y(s) and we can simply eliminate this peak 
from our sequence. | 

It remains to deal with irreducible peaks in the height function of our sequence. Fortu- 
nately, it turns out that they only occur in a very particular way. To describe the particular 
way, define a weighted Seifert circle in a manner which is similar to the weights that we 
attach to closed braid diagrams, e.g. as in Figure [7J That is, A Seifert circle with weight 
w attached means a collection of w coherently oriented, nested, parallel Seifert circles. We 
use the term band for a Seifert circle with an attached weight. 

Lemma 2.7 If {Y(r),Y ,Y(s)} is an irreducible peak in the height function of our sequence, 
then the diagram Y contains at most four bands, arranged as in Figure 

Proof: Let {Y(r),Y ,Y(s)} be an irreducible peak, and let C\,C2 be the two circles 
involved in the reducing arcs a r and a s . See Figure UTT i). For i = 1,2, let Di denote 




Figure 11: (i) The diagram corresponding to to the irreducible peak {Y(r),Y ,Y(s)}. (ii) 
Two additional reducing arcs used to eliminate irreducible peaks. Each pair a r ,a s and 
possible non-commuting pair associated to the peak. The blocks X\, X2, X3, X4 
indicate (possibly empty) collections of signed arcs. 

the disk bounded by Cj in S 2 which does not contain the reducing arcs a r and a s . Then 
S 2 \ {D\ U D2 U a r U a s ) has two components. By assumption, neither component can 
contain a defect region (or else we could find a reducing arc a t as above, contradicting the 
irreducibility of our peak). Thus if either component contains any Seifert circles, the circles 
must form a band, oriented oppositely to C\ and C2. We allow the possibility that the 
weight of these bands is zero. The same reasoning shows that Di cannot contain a defect 
region for i = 1, 2 and hence that Cj must be the outer circle of a band. It is of course 
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possible that some braiding takes place between adjacent coherent circles, as indicated in 
Figure ITlT i). The braids joining the various bands are labelled JQ. Thus we have a diagram 
of the given form, in which each circle pictured represents a band, and the lemma is proved. || 
The following lemma will allows us to replace irreducible peaks in the height function of 

our sequence with peaks of strictly smaller height. 

Lemma 2.8 Let {Y(r),Y, Y(s)} be an irreducible peak of height n+1 in the height function 
of our sequence. Then there exist sequences of diagrams Y(r) = Y{, . . . , Y£ = Y(p o r) and 
Y(s) = Yf, . . . ,Y£ = Y{u o s) such that Y[ +l (resp. Y? +l ) is obtained from Y? (resp. Yf) 
by a reducing move and such that h(Y(p o r)) = h(Y(u o s) = and Y{p o r) and Y{u o s)) 
are Markov equivalent. 

Before discussing the proof of this lemma, we show how to use it to prove the Markov 
theorem. Using Lemmas 12.312.71 we have reduced the proof of Markov's Theorem to the 
situation of two closed braid diagrams X and X' related by a sequence of diagrams related 
by reducing moves (and their inverses) such that the height of each intermediate diagram 
is strictly positive and such that any peak in the height function of the sequence is irre- 
ducible (of height at least 2). Let {Y(r), Y, Y(s)} be an irreducible peak of height n in the 
height function of our sequence. By Lemma l2.8( we can replace this subsequence with a 
subsequence of strictly smaller height (possibly including a subsequence entirely at height 
related by Markov moves, which we can remove from consideration as before). In doing 
so, we create new peaks whose height is strictly lower than the height of the peak being 
replaced. If we perform this operation at every irreducible peak, then we obtain a new 
sequence relating our closed braid diagrams Y, Y' . 

The new peaks may or may not be irreducible; in fact, their corresponding arcs may not 
even be disjoint, but now we are back in the same situation as we were before Lemma 12.41 
except that we are starting with a sequence of lower height. Thus by induction on the height 
of the sequence (with base case provided by Lemma l2~H|) . we can replace any sequence as 
described in Lemma 12 . 31 with a sequence consisting entirely of diagrams of height zero. This 
completes the proof of Theorem 0] modulo the proof of Lemma 12.81 1| 

Sketch of the Proof of Lemma 12. 8L By Lemma 12.71 the diagram Y has at most 
four bands which we label B\, B2, -B3, and B4, joined by four (possibly trivial) braids 
Xi,X2,X$,X4, as in Figure UTT i) and (ii). Let Wi denote the weight of the band B{. 
Note that, e.g., the braid X\ has w\ + W3 strands, and similarly for the other Xj. If Wi or 
Wj is greater than 1, then a reducing arc joining Bi, Bj is understood to indicate a sequence 
of WiWj reducing moves. There are several ways to construct such a sequence; we adopt the 
convention that we choose reducing moves in such a way that the strands of one band all 
slide under or else all slide over the strands of the other band. 

Referring again to Figure ITT1 we first perform the reducing move r along the arc a r and 
then reduce again via the arc a p involving B3 and B4, giving us some number of reducing 
moves depending on the weights of the bands involved. The resulting diagram Y{p o r) is 
the closure of the first braid shown in Figure IT2*l up to a choice of arcs passing over or under 
in the various Reidemeister moves of type II. This gives us the first sequence of the lemma. 
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Figure 12: Two braids whose respective closures are the result of performing the two pairs 
of reducing moves r,p and s,u indicated in Figure ITT1 

To get the second sequence of the lemma, we begin instead with the reducing move s 
along the arc a s and then reduce again via the arc a u . The resulting diagram Y(u o s) is 
the closure of the second braid shown in Figure IT2l 

We have found our two sequences of reducing moves, and we have reduced the proof of 
Markov's Theorem to just one specific calculation, namely, showing that the closed braids 
Y(p o r) and Y(u o s) corresponding to the two braids of Figure El are M-equivalent. The 
two braids can be related by a sequence consisting of several braid isotopies as well as two 
stabilizations and two destabilizations; the reader is referred to |129j for the details of this 
calculation. This concludes the proof of Lemma 12.81 and so also of Theorem |IJ | 

Remark 2.2 Our choice of 'over' or 'under' in the reducing moves p and u leads to possible 
ambiguity, but the various diagrams which would result from different choices are all related 
by 'exchange moves', which are defined and discussed at the beginning of the next section. 
We will show in the next section that an exchange move replaces a sequence of 4 Markov 
moves: a braid isotopy, a stabilization, a second braid isotopy and a destabilization. 

The essential groundwork has been laid regarding the connection between braids and 
knots. From this point on, all knot diagrams will be assumed to be in closed braid form, 
i.e. in the form indicated in Figure E3 In the sections that follow we will examine many 
consequences. 
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3 Braid foliations 



We begin our study of new results which relate to the study of knots via closed braids by 
presenting some results that use the theory of braid foliations of a Seifert surface bounded 
by a knot which is represented as a closed braid. We will develop three applications of braid 
foliations: The first is Theorem in fcl3.ll We give an essentially complete proof of the 
'Markov theorem without stabilization' (MTWS) in the special case of the unknot, based 
upon the presentation in 21 . In ^3.21 we state the MTWS, Theorem H3 in the general case. 
A full proof of that theorem can be found in |3U| . In the final section, N3.3I we give an 
application of the MTWS to contact topology. 

3.1 The Markov Theorem Without Stabilization (special case: the un- 
knot) 

After describing the basic ideas about braid foliations, we will apply them to the study 
of a classical problem in topology, the unknot recognition problem. Alexander's Theorem 
(Theorem |2J) tells us that every link K may be represented as a closed n-braid, for some n. 
Markov's Theorem (Theorem^J tells us how any two closed braid representatives of the same 
knot or link are related. Looking for a way to simplify a given closed braid representative 
of a knot or link systematically, the first author and Menasco were lead to the study of the 
unknot as a key example. There is an obvious choice of a simplest representative, namely a 
1-braid representative. The Markov Theorem Without Stabilization for the unknot, which 
is stated below as TheoremEJ asserts that, in the special case of the unknot, the stabilization 
move of Markov's Theorem can be eliminated, at the expense of adding the exchange move. 
Therefore we begin with a discussion of the exchange move, which is defined in Figure ITU 
and the reason why it is so important. 




Figure 13: The left top and bottom sketches define the exchange move. The right sequence 
of 5 sketches shows how it replaces a sequence of Markov moves which include braid isotopy, 
a single stabilization, additional braid isotopy and a single destabilization. 
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A natural question to ask is what is accomplished by stabilization, braid isotopy and 
destabilization? Figure El shows that exchange moves are the obstruction to sliding a 




braid isotopy exchange move 



braid isotopy 



Figure 14: Exchange moves are the obstruction to sliding a trivial loop around a closed 
braid. 

trivial loop around a braid [34] . However there is more at issue than just sliding trivial 
loops around a braid. As is illustrated in Figure Hoi a sequence of exchange moves, together 
with braid isotopy, can create infinitely many closed braid representatives of a single knot 
type, all of the same braid index. This bypasses a key question: do exchange moves actually 
change conjugacy class? The answer is "yes", and in fact the phenomenon first appears in 
the study of closed 4-braid representatives of the unknot. In £0 we show that there is a 
definitive test for proving it. 




exchange move 1 



Figure 15: A sequence of exchange moves and braid isotopies with non-trivial consequences. 
Keeping Figures H^l 1141 and ll5l in mind, we are ready to state the MTWS for the unknot: 



Theorem 5 29 Every closed braid representative K of the unknot U may be reduced to the 
standard 1-braid representative U\, by a finite sequence of braid isotopies, destabilizations 
and exchange moves. Moreover there is a complexity function associated to closed braid 
representative in the sequence, such that each destabilization and exchange move is strictly 
complexity-reducing. 
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The first proof of Theorem |S] was the one in [25] . A somewhat different and slicker 
proof can be found in |21j . but it requires more machinery than was necessary for present 
purposes. We follow the proof in j^H]. However, parts of our presentation and most of 
our figures were essentially lifted (with the permission of both authors) from j^J, a review 
article on braid foliation techniques. Our initial goal is to set up the machinery needed for 
the proof. 

Recalling the definition of a closed braid from ^2.11 we adopt some additional structure 
and say that K is in closed n-braid form if there is an unknot A in S 3 \ K, and a choice of 
fibration H of the solid torus S 3 \ A by meridian disks, such that K intersects each fiber of 
H transversely. Sometimes it is convenient to replace S 3 by i? 3 and to think of the fibration 
H as being by half-planes {Hg;9 £ [0, 2-7r]} of constant polar angle 9 through the z-axis. 
Note that K intersects each fiber Hg in the same number of points, that number being the 
index n of the closed braid K. We may always assume that K and A can be oriented so 
that K travels around A in the positive direction, using the right hand rule. Now let K be 
a closed braid representative of the unknot, and let T> denote a disk spanned by K, oriented 
so that the positive normal bundle to each component has the orientation induced by that 
on K = dT>. In this section we will describe a set of ideas which shows that there is a 
very simple method that changes the pair (K,T>) to a planar circle that bounds a planar 
disc, via closed braids, moreover there is an associated complexity function that is strictly 
reducing. The ideas that we describe come from |29j . however our main reference will be to 
the review article |21| . 

The braid axis A and the fibers of H will serve as a coordinate system in 3-space in 
which to study T>. A singular foliation of T> is induced by its intersection with fibers of H. 
A singular leaf in the foliation is one which contains a point of tangency with a fiber of H. 
All other leaves are non-singular. 

It follows from standard general position arguments that the disk T> can be chosen to 
be 'nice' with respect to our fibration. More precisely, we can assume the following: 

(i) The intersections of A and T> are finite in number and transverse. 

(ii) There is a neighborhood Na of A in R s \ K such that each component of T> n Na is 
a disk, and each disk is radially foliated by its arcs of intersection with fibers of H. 
There is also a neighborhood Nk of K in Ft? such that Nk n T> is foliated by arcs of 
intersection with fibers of H which are transverse to K. 

(iii) All but finitely many fibers Hg of H meet T> transversely, and those which do not 
(the singular fibers) are each tangent to T> at exactly one point in the interior of 
both T> and Hg. Moreover, each point of tangency is a saddle point (with respect to 
the parameter 0). Finally, each singular fiber contains exactly one singularity of the 
foliation, each of which is a saddle point. 

It can also be assumed (see f° r details) that 

(iv) Each non-singular leaf is either an a- arc , which has one endpoint on A and one on 
K = dT> or a b- arc , which has both endpoints on A. 
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(vii) Each 6-arc in a fiber Hg separates that fiber into two components. Call the 6-arc 
essential if each of these components is pierced at least once by K, and inessential 
otherwise. Then we also have that all 6-arcs in the foliation of T> may be assumed to 
be essential. 

Note that T> cannot be foliated entirely by 6-arcs, since such a surface is necessarily 
a 2-sphere. Thus, if the foliation of T> contains no singularities, T> is foliated entirely by 
a- arcs. Otherwise, let U be the union of all the singular leaves in the foliation of T>. Moving 
forward through the fibration, we see that any singular leaf in the foliation is formed by 
non-singular leaves moving together to touch at a saddle singularity. The three types of 
singular leaves which can occur are labelled aa, ab or 66, corresponding to the non-singular 
leaves associated to them. Now each singular leaf A in U has a foliated neighborhood N\ 
in D such that N\ PiU = A. According to whether A has type aa, ab, or 66, N\ is one of 
the foliated open 2-cells shown in Figure fT^T i^ . with the arrows indicating the direction of 
increasing 9. 
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Figure 16: (i) The three types of tiles in the decomposition of T>; (ii)The canonical embed- 
ding of each type of tile. 

The complement of U in T> is a union B\ U B2 U ... U B^, where each B{ is foliated entirely 
by a-arcs or entirely by 6-arcs. Choose one non-singular leaf in each Bi and declare it to be 
a boundary arc of type a or type 6 according to whether it is an a-arc or 6-arc, respectively. 
Then the union of all boundary arcs determines a tiling of T>, that is, a decomposition into 
regions called tiles , each of which is a foliated neighborhood of one singular leaf. Each tile 
has type aa, ab and 66, according to the type of its unique singularity. Note that a tiling 
of P is a foliated cell-decomposition. We further define the sign of a singular point s in the 
foliation of T> to be positive if the positive normal to T> points in the direction of increasing 
8 in the fibration, negative otherwise. The sign of a tile is then defined to be the sign of its 
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singularity. 

The axis A intersects the surface in a finite number of points, called vertices of the tiles. 
Each vertex v is an endpoint of finitely many boundary arcs in the surface decomposition. 
Let the type of v be the cyclic sequence {x\, ...,x r ), where each Xj is either a or b, and the 
sequence lists the types of boundary arcs meeting at v in the cyclic order in which they 
occur in the fibration. See Figure IT71 The valence of a vertex v is the number of distinct 




type (b,b) type (a,b) 

Figure 17: A vertex v of type (a), (ab) and (bb). 

tiles intersecting at v. The sign of a vertex v is the cyclic array of signs of the tiles meeting 
at v. See Figure IT7I 

The parity of v is said to be positive (resp. negative) if the outward-drawn normal to 
the surface has the same (resp. opposite) orientation as the braid axis at the vertex. Thus 
when we view the positive side of the surface, the sense of increasing 9 around a vertex 
will be counterclockwise (resp. clockwise) when the vertex is positive (resp. negative), as 
illustrated in Figure ITnT i) . 

Finally, we note that (again, see |2J for a proof) that, up to a choice of their sign, tiles 
of type aa, ab, bb each have a canonical embedding in 3-space, which is determined up to an 
isotopy of 3-space which preserves the axis A and each fiber of H setwise. The canonical 
embedding for each tile is shown in Figure I16f ii) . 

The decompositions of the disk T> which we have just described are not unique. We 
shall now describe three ways in which they can be changed. In each of the three cases the 
possibility of making the change is indicated by examining the combinatorics of the tiling. 
The change is realized by an isotopy of the disk which is supported in a neighborhood N 
of a specified small number of tiles of type aa, ab or bb, leaving the decomposition of T> 
unchanged outside N. 

For a spanning disk T> and a tiling T, we denote the complexity of the tiling by c(T>, T), 
where we define c(T>, T) = (n, S), where n is the braid index of the boundary and S is the 
number of singularities in the tiling. Setting and V- equal to the number of positive and 
negative vertices, one has n = V+ — V-, so n is also determined by the tiling. Tiled discs 
(V,T) can then be ordered, using lexicographical ordering of the associated pair c(V,T). 
Each of our three moves will replace the given disk and tiling T),T with some T>',T', with 
the following effects on complexity: 

• Change in foliation: (n',5") = (n, S). 
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• Destabilization: (n 1 , S') = (n — 1, S — 1). 



• Exchange moves (two types): (n',S') = (n,S — 2). 

We focus here only on the combinatorics of tilings which admit one of the above moves, 
as well as the effect of each move on the embedding of T> and the new tiling of T> which 
results. For proof and further details, see |21j . 

•Changes in foliation. The choice of a foliation of T> is not unique, and our first move 
involves ways in which the surface decomposition can be changed by an isotopy of T> or, 
equivalently, by an isotopy of the fibers of H keeping T> fixed. This particular change was 
introduced in |28j for 2-spheres and was modified in j^H] for certain spanning surfaces. In 
what follows, we say that two tiles are adjacent if they have a common boundary arc in the 
given tiling. 



(i) 



(ii) 




Figure 18: Sketch (i) shows an isotopy of T> that induces a change in foliation. Sketch (ii) 
shows the effect of the change on two a6-tiles 

A tiling admits a change in foliation whenever there are two tiles Ti, T<i of the same 
sign adjacent at a fo-arc. Roughly speaking, a change in foliation is a local isotopy of the 
surface which pushes two saddle points past each other. Locally, the disk V is embedded as 
in the left sketch in Figure fTHT i^ . A change in foliation is defined as the passage from the 
embedding left to the right embedding. Figure llBf ii) the effect of this move on the local 
foliation, while Figure IT8l ni) illustrates the effect of a change in foliation on the tiling, in 
the case of two adjacent (arj)-tiles. 

•Destabilization via a type (a) vertex. A vertex v of valence 1 in a tiling of T> occurs 
when two of the edges in a single tile T are identified in T>. Such a vertex must have type 
(a), and therefore T is an aa-tile. See Figure IT71 Since there is a canonical embedding for 
an aa-tile in 3-space (Figure I16|) , identifying two edges of an aa-tile with endpoints on a 
common vertex yields the canonical embedding for T shown in Figure HUT i). Notice that 
there is a radially foliated disk D in T cut off from T> by the arc of the singular leaf of T 
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with both endpoints on K. Hence there must be a trivial loop in the braid representation 
of K, and we can modify the braid and the disk T> in the manner illustrated. We call this 
modification destabilization via a type (a) vertex. The effect on the tiling of T> is that the 
aa-tile T and its type (a) vertex v are deleted, while the tiling outside T is unaltered. The 
braid index is decreased by one. See the right sketch in Figure ITW i), 



AAA 




Figure 19: The effect of (i) a destabilization, and (ii) an exchange move of type (bb), on 
the embedded (and foliated) foliated disc V and on the braid K that is its boundary. 

•Exchange moves. The destabilization move that we just described can be accomplished 
if a vertex of valence 1 exists in our tiling. Exchange moves are based upon the existence 
of a vertex of valence 2. An example of the type of move we would like to achieve on the 
braid itself was shown in sketch (ii) of Figure llijf ii) , and also in in the second row from the 
top in Figure 1201 There are two different types of tiling patterns, and hence two different 
types of local embeddings of T>, which support exchange moves: 

Exchange Move Type (bb): We now consider a vertex v with sign (±,=p) and type 
(b,b). In this situation, two fcfe-tiles are adjacent along consecutive 6-arcs. The canonical 
embeddings of the tiles in this case are shown in Figure ITH1 and the overall modifications to 
V are suggested by Figure HUT ri^. The end result is a new surface V whose decomposition 
has at least two fewer vertices (in particular, the vertex v is deleted) and at least two fewer 
regions than the decomposition of P. | 

Exchange Move, Type (ab) An exchange move of type (ab) is possible whenever the 
tiling of T> has a vertex v of valence 2, type (a,b), and sign (±, =p). Such a vertex occurs 
only when two ab-tW.es are adjacent along corresponding a and b edges, as in Figure IT7I The 
isotopy of dT> = K is achieved by pushing a subarc a of K across a disk which is contained 
in the union of two (ab) tiles Ti,T2, to a new arc a'. The effect on the tiling is that the 
a6-tiles T\ and are deleted and any adjacent affiles (resp. 66-tiles) become aa-tiles (resp. 
a6-tiles). 

In order to prove the MTWS for the unknot, we must locate the places on the tiling 
of T> where the three moves just described can be made. We begin with two lemmas. The 
first of the two is implicit in [Hj, but we refer the reader to [2^ f° r an explicit proof. 
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Lemma 3.1 [S] Let v be a vertex of type (b,b, . . . ,b). Then the set of all singularities s 
which lie on a singular leaf ending at v contains both positive and negative singular points. 



Lemma 3.2 _29j Suppose that the disk T> is nontrivially tiled and has no vertices of valence 
1. Then the tiling ofT> contains a vertex of type (ab), (bb), or (bbb). 

Proof. The proof is essentially an Euler characteristic argument. A tiling T of the disk 
T> corresponds to a cell decomposition of the 2-sphere in the following way. Let V be the 
number of vertices in T, let E be the number of boundary arcs in T, which we will think 
of as edges, and let F be the number of tiles in T. Now let £ be the 2-sphere obtained 
by collapsing dT> = K to a point. This gives a cell decomposition of £ with V + 1 0-cells 
(vertices), E 1-cells (edges), and F 2-cells (faces). We know that the Euler characteristic 
x(S) = 2, and therefore we have that V — E + F = 1. We also know that every face in 
our cell decomposition of £ has 4 edges, and that every edge has 2 adjacent faces, and so 
E = 2F. We therefore have the following equation. 

2V - E = 2. (10) 

Now let V(a, (3) denote the number of vertices in T with a adjacent a-arcs and (3 adjacent 
6-arcs. If v is the valence of such a vertex, then v = a + (3, and so V(a, v — a) denotes the 
number of vertices in T with valence v and a adjacent a-arcs. Since by hypothesis v ^ 1, 
we can now count the number of vertices in T using the following summation: 



v=2 a=0 



Now write E = E a + Ei,, where E a denotes the number of edges in T of type a, and similarly 
for Ef). Since an a-edge is incident at one vertex in T, and a 6-edge is incident at two vertices 
in T, we can write: 

OO V 

v - a) (11) 

v=2 o=0 

^ OO V 

v =2 a=0 

Substituting into Equation 1101 we now have 

OO V 

y~^y^(4- v — a)V(a, v - a) = 4. 



v=2 a=0 



Observe now that if v > 4, the coefficient (4 — v — a) < 0. Writing out the terms 
corresponding to v = 2 and v = 3 gives, respectively, 2V(0, 2) + V(l, 1) and V(0, 3) — 
V{2, 1) — 2V(3,0). Each V(a,(3) > by definition. Hence we can move terms around in 
the above equation so that each term is nonnegative, as follows: 

OO V 

21/(0, 2) + V(l, 1) + V(0, 3) = 4 + V{2, 1) + 2V{3, 0) + ^(4 -v- a)V{a, v - a). 

u=4 a=0 
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The right-hand side of the equation is clearly > 4, and therefore at least one term on the 
left-hand side is nonzero. Since V(0, 2), V(l, 1), and V(0, 3) record the number of type (bb), 
type (ab), and type (bbb) vertices, respectively, the lemma is proved. | 

We are now ready to prove Theorem |SJ the MTWS in the case of the unknot. 

Proof of Theorem 4. We start with an arbitrary closed braid representative K of the 
unknot. Our closed braid K is the boundary of a disk T> which admits a tiling T. Note that 
if the complexity of the tiling c(V, T) is equal to (1,0), then our disk is radially foliated by Cl- 
ares and K is the standard embedding of the unknot. Therefore we assume c(T>, T) > (1,0). 
We will use induction on c(V, T) to show that after a finite sequence of our three moves, V 
is radially foliated by a-arcs. 

We first observe that if there exists a vertex of type (a) in T, then we can destabilize 
along this vertex, thereby reducing c(T>,T). Hence we can asume that all vertices in T 
have valence at least 2. By Lemma 13.21 T> must have a vertex of type (ab), (bb), or (bbb). If 
there is a type (bbb) vertex, then at least two of the adjacent tiles must have the same sign. 
In this case we can do a change in foliation, replacing our (bbb) vertex with a (bb) vertex. 
Therefore we may assume that T> contains a vertex of type (ab) or type (bb). 

If D has a type (ab) vertex with sign (+,+) or (-,-), then we can apply a change in 
foliation which replaces this vertex with a type (a) vertex. We now destabilize along this 
type (a) vertex in order to reduce the complexity. If T> has a type (ab) vertex with sign (+, 
-), then we can do an exchange move of type (ab), thereby reducing complexity. If T> has a 
type (bb) vertex, then it is an interior vertex, and by Lemma 13.11 it has sign (+,-). Then 
we can do an exchange move of type (bb), which reduces complexity. 

Thus if c(V,T) > (1,0), we can always reduce until c(V,T) = (1,0), and the theorem 
is proved. | 

Example 3.1 [Morton's irreducible 4-braid] We now give an example illustrating the 
necessity of exchange moves. In 109 '., Morton gave the following example of an irreducible 
braid: 

X = 1T3 2 0"2<7 3 1 0"20"f<72 0"10" 2 ) 

i.e., a braid whose closure represents the unknot but cannot be isotoped to the unknot in the 
complement of the braid axis. Now, it is clear that if a braid in B n admits a factorization 
of the form wia n -iW2(T~\, where w\ and W2 are words in a\, . . . , <7 n _ 2 , then its closure 
admits an exchange move. As observed in [55], the conjugate braid (a^a2T\)X (crjj -1 a^ 1 c^ 1 ) 
is isotopic to the braid 

X ' = ( (J 2 2a l (T 2 1 ) (T 3( cr 2 a l lcr 2) (J 3 ■ 

Thus X' admits an exchange move to obtain the new braid word (a^axa^ )a^ (a\a^[ 1 o"2)f"3, 
which can be cyclically rewritten as 

X" = (o- 2 o" 3 cr2 2 )f7i(cr2 fjVDcrf . 

If we now interchange the axis of our braid with the point at infinity, we can perform another 
exchange move to obtain 

X'" = (a 2 a 3 a^)a^(a 2 1 a^a 3 2 )a l 
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which can in fact be isotoped to the unknot in the complement of its braid axis |29j . 

Wright has given a foliated disk corresponding to this braid in |133j . The reader is 
cautioned that Wright uses a different notation convention for braid words than that used 
in this paper (see p. 98 of j!33| b We also refer the reader to 32 for further examples. 4 

We note that Theorem has been used as the basis for an algorithm for recognizing the 
unknot. See [22] . This algorithm has been put on a computer [32], however it needs more 
work before it can become a practical tool for recognizing the unknot. 

Open Problem 2 At this writing the development of a sound and practical algorithm for 
unknot recognition remains one of the major open problems in low dimensional topology. A 

With regard to Problem [2] we note that Theorem 03 proves the existence of a monotonic 
simplification process which begins with an arbitrary closed n-braid representative of the 
unknot and ends with a 1-braid representative, but the complexity function c(T>, T) which 
gives instructions for the process is 'hidden' in the tiling of the surface T>. We need it in 
order to know when complexity-reducing destabilizations and exchange moves are possible. 
The reader who is interested in this problem might wish to consult [22] ; where a somewhat 
different approach suggests itself. Instead of working with the tiled surface, one may work 
with the 'extended boundary word' of [22]. The latter is a closed braid which is obtained 
from an arbitrary closed braid representative of the given knot by threading in additional 
1-braids, and it seems likely that it will give an alternative monotonic reduction process. 
Thus we suggest: 

Open Problem 3 Investigate the monotonic reduction process of Theorem\^from the point 
of view of the extended boundary word of |22j . X 

3.2 The Markov Theorem Without Stabilization, general case 

In this section we state the generalized version of Theorem [5] which was established in j3U] 
for arbitrary closed braid representatives of arbitrary knots and links. The moves which 
are needed are of course much more complicated than in the case of the unknot. They are 
described in terms of 'block strand diagrams' and 'templates'. The concept of a block-strand 
diagram is fairly easy to understand from the 8 examples in Figure [2771 The first important 
feature of a block-strand diagram is that after an assignment of a braided tangle to each 
block, it becomes a closed braid that represents a specific knot or link. Thus each block 
strand diagram determines infinitely many closed braid representatives of (presumably) 
infinitely many knots and links. A template is a pair of block-strand diagrams, both of 
which represent the same knot or link, when we make the same braiding assignments to 
corresponding blocks. Note that there are two destabilization templates (they differ in the 
sign of the 'trivial loop' that is removed). There are also two flype templates, which differ 
in the sign of the single crossing which is outside all the braid boxes. Templates always 
have an associated braid index, namely the braid index of any knot or link of minimum 
braid index that they carry. Destabilization templates occur for every braid index > 2. 
Since the exchange move can be realized by braid isotopy when the braid index is < 3, it 
does not play a role until braid index > 4. The flype templates occur for braid index > 3. 
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The two destabilization templates 




The exchange move template 







The two flype templates 



jZI — — > 



J \ r 



A 6-braid template 



Figure 20: Examples of templates 
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The method by which the 6-braid template in Figure QUI was constructed is described in the 
manuscript |3T?j . 

The two block strand diagrams in a template are always related by a sequence of Markov 
moves, however the sequence may be quite complicated, and so the isotopy that takes the 
left diagram to the right diagram is in general not obvious. This fact is illustrated by the 
6-braid template of Figure HOJ (See j30] for the isotopy, as an explicit sequence of Markov 
moves.) 

With this brief introduction, we are able to state the Markov Theorem Without Stabi- 
lization in the general case: 

Theorem 6 30_ Let B be the collection of all braid isotopy classes of closed braid repre- 
sentatives of oriented knot and link types in oriented 3-space. Among these, consider the 
subcollection B(K) of representatives of a fixed link type K . Among these, let B m i n {K) 
be the subcollection of representatives whose braid index is equal to the braid index of K . 
Choose any X + G B(K) and any X— £E B m i n {JX). Then there is a complexity function 
which is associated to X + ,X- , and for each braid index m a finite set T{m) of templates is 
introduced, each template determining a move which is non-increasing on braid index, such 
that the following hold: First, there is are initial sequences which modify X- — » X'_ and 
X+^X' + : 

x_ = x l _^ ► x p = x'_, x + = x\ -> . . . -> x% = x' + 

Each passage X'i_ — > Ai +1 is strictly complexity reducing and is realized by an exchange 
move, so that = b(Xi). These moves 'unwind' X^, if it is wound up as in the 

top right sketch in Figure \15l Each passage Xi. — > is strictly complexity-reducing 

and is realized by either an exchange move or a destabilization, so that 6(Ar}_ +1 ) < b(Xi_). 
Replacing X + with X' + and X- with X'_, there is an additional sequence which modifies 
X' + , keeping X'_ fixed: 

X' + = X q -> > x r = x'_ 

Each passage X^ — ► in this sequence is also strictly complexity-reducing. It is realized 

by one of the moves defined by a template T in the finite set T(m), where m = b(X + ). 
The inequality b(X^ +1 ) < b(X^) holds for each j = q, . . . ,r — 1 and so also for each j = 
l,...,r-l. ' 

The proof of Theorem uses the braid foliation techniques that were used in the proof 
of Theorem El but in a more complicated setting. Instead of looking at a foliated embedded 
disc which is bounded by a given unknotted closed braid, we are given two closed braids, 
X + and and an isotopy that takes X + to X_. The trace of the isotopy sweeps out an 
annulus, but in general it is not embedded. The proof begins by showing that the given 
isotopy can be split into two parts, over which we have some control. An intermediate link 
Xq which represents the same link type K as X + and X_ is constructed, such that the trace 
of the isotopy from X + to Xq is an embedded annulus A+. Also the trace of the isotopy 
from Xq to A_ is a second embedded annulus A- . The union of these two embedded annuli 
TA = A+ U A- is an immersed annulus, but its self-intersection set is controlled, and is a 
finite number of clasp arcs. The main tool in the proof of Theorem is the study of the 
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braid foliation of the immersed annulus TA. In the next section, i.e. ij3.3l we will see how 
Theorem was used to settle a long-standing problem about contact structures. 

3.3 Braids and contact structures 

In this section we describe how Theorem |B] was used in [3^ to settle a problem about contact 
structures on R 3 and <S 3 . 

Let A be the z-axis in M 3 , with standard cylindrical coordinates (p,9,z) and let H be 
the collection of all half-planes Hg through A. The pair (A, H) defines the standard braid 
structure on M 3 . Using the same cylindrical coordinates, let a be the 1-form a = p 2 d6 + dz. 
The kernel £ of the 1-form a defines a contact structure on M 3 . We can visualize £ by 
imagining that there is a 2-plane (spanned by d/dx and d/dp) attached to every point 
in M 3 . Figure |^ shows both the braid structure and the polar contact structure, for 
comparison. The family of 2-planes that define £ twist (to the left) as one moves along the 
x-axis from to oo. The family is invariant under rotation of 3-space about the z-axis and 
under translation of 3-space along rays parallel to the z-axis. Its salient feature is that it is 
totally non-integrable, that is there is no surface in M 3 which is everywhere tangent to the 
2-planes of (£) in any neighborhood of any point in M 3 . (Of course this makes it hard to 
visualize). The twisting is generic in the sense that, if p is a point in a contact 3- manifold 
M 3 , then in every neighborhood of p in M 3 the contact structure is locally like the one we 
depicted in Figure l2*Tl 




standard braid structure standard tight polar contact structure 



Figure 21: The standard braid structure and tight contact structure on M 3 

Let K be a knot (for simplicity we restrict to knots here, but everything works equally 
well for links) which is parametrized by cylindrical coordinates (p(t),6(t), z(t)), where t £ 
[0, 2ir]. Then, as defined in H2.ll K is a closed braid if p{t) > and d9/dt > for all t. 
On the other hand, K is a Legendrian (resp. transversal ) knot if it is everywhere (resp. 
nowhere) tangent to the 2-planes of £. In the Legendrian case this means that on K we have 
dO/dt = (—l/p 2 )(dz/dt). In the transversal case we require that dO/dt > (-l/p 2 )(dz/dt) 
at every point of K{t). We are interested here primarily in the transversal case. 

The total twist of the contact structure is the number of multiples of tt/2 as one traverses 
the positive real axis from the origin to oo. The case when the total twist angle is tt is known 
as the standard (polar) contact structure. We call it £, w . While the braid structure is very 
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different from the standard polar contact structure near A, for large values of p the 2- 
planes in £ n are very close to the half-planes Hg of the braid structure. In fact, assume 
that X is a closed braid which represents a knot K, and that X bounds a Seifert surface of 
minimum genus which supports a foliation as in H3.ll Assume further that the closed braid 
X intersects every 2-plane in the contact structure transversally. Then, in the complement 
of a tubular neighborhood of the braid axis, from the point of view of a topologist, the braid 
foliation and the foliation induced by the contact structure will be 'the same'. 

Recall that Theorem^ (Alexander's Theorem) was first proved in 1925. Sixty years later, 
Bennequin adapted Alexander's original proof (which is different from the proof given in this 
article) to the setting of transversal knots in [Sj , where he showed that every transversal knot 
is isotopic, through transversal knots, to a closed braid. In 2002 Orevkov and Shevchishin 
extended Bennequin's ideas and proved a version of Theorem 21 (Markov's Theorem) which 
holds in the transversal setting: 

Theorem 7 |116j : Let TX + ,TX^ be closed braid representatives of the same oriented link 
type K in oriented 3-space. Then there exists a sequence of closed braid representatives of 
TK: 

TX+ = TXi -► TX 2 -> ► TX r = TX X 

such that, up to braid isotopy, each TXi + \ is obtained from TXi by a single positive stabi- 
lization or destabilization. 

Is the Transverse Markov Theorem really different from the Markov Theorem? Are there 
transversal knots which are isotopic as topological knots but are not transversally isotopic? 
To answer this question we take a small detour and review the contributions of Bennequin 
in 0. 

Why did topologists begin to think about contact structures, and analysts begin to think 
about knots? While we might wish that analysts suddenly became overwhelmed with the 
beauty of knots, there was a more specific and focused reason. At the time that Bennequin 
did his foundational work in [§] it was not known whether a 3-manifold could support 
more than one isotopy class of contact structures. Bennequin answered this question in the 
affirmative, in the case of contact structures on M 3 or which were known to be homotopic 
to the standard one. His tool for answering it was highly original, and it had to do with 
braids and knots. 

Let TK be a transversal knot. By the above, we can assume without loss of generality 
that it is a closed braid. Let TK be its transversal knot type, i.e., its knot type under 
transversal isotopies, and let [TK] top be its topological knot type. Choose a representative 
TX of TK, which (by Bennequin's transversal version of Alexander's theorem) is always 
possible. Choose a Seifert surface F of minimal genus, with TX = dF. Bennequin studied 
the foliation of F which is induced by the intersections of F with the plane field determined 
by £„-. Let n(TX) be the braid index and let e(TX) be the algebraic crossing number 
of a projection of TX onto the plane z = 0. Both can be determined from the foliation. 
Bennequin found an invariant of TK, given by the formula (3{TK) = e(TX) — n(TX). Of 
course if he had known Theorem the proof that (3{TK) is an invariant of TK would have 
been trivial, but he did not have that tool. He then showed a little bit more: he showed that 
(5{TK) is bounded above by the negative of the Euler characteristic of F in He then 
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showed that this bound fails in one of the contact structures £ >7r . In this way he proved 
that the contact structures £ >7r cannot be isotopic to 

To knot theorists, Bennequin's proof should seem intuitively natural, because the in- 
variant (3 (TIC) is a self-linking number of a representative TX £ TIC (the sense of push-off 
being determined by £), and the more twisting there is the higher this number can be. For 
an explanation of the self-linking, and lots more about Legendrian and transversal knots we 
refer the reader to John Etnyre's excellent review article The basic idea is that TX 
bounds a Seifert surface, and this Seifert surface is foliated by the plane field associated 
to £. Call this foliation the characteristic foliation . Near the boundary, the characteristic 
foliation is transverse to the boundary. The Bennequin invariant is the linking number of 
TX,TX', where TX' is is a copy of TX, obtained by pushing TX off itself onto F, using 
the direction determined by the characteristic foliation of F. 

Bennequin's paper was truly important. Shortly after it was written Eliashberg showed 
in [211] that the phenomenon of an infinite sequence of contact structures related to a single 
one of minimal twist angle occured generically in every 3-manifold, and introduced the term 
'tight' and 'overtwisted' to distinguish the two cases. Here too, there is a reason that will 
seem natural to topologists. In 2003 Giroux proved [75] that every contact structure on 
every closed, orientable 3-manifold M 3 can be obtained in the following way: represent M 3 
as a branched covering space of 5 3 , branched over a knot or link, and lift the standard and 
overtwisted contact structures on S 3 to M 3 . 

Returning to knot theory, the invariant (3(TIC) allows us to answer a fundamental ques- 
tion: is the equivalence relation on knots that is defined by transversal isotopy really differ- 
ent from the equivalence relation defined by topological isotopy? The Bennequin invariant 
will be used to answer this question in the affirmative: 

Theorem 8 9 There are infinitely many distinct transversal knot types associated to each 
topological knot type. 

Proof: Choose a transversal knot type TIC and a closed braid representative TXq. Sta- 
bilizing the closed braid TXq once negatively (recall the definition of positive/negative 
stabilizations given in N2.31 Figure UJ , we obtain the transverse closed braid TX\ , with 
e(TX x ) = e(TX ) - 1 and n(TX x ) = n(TX ) + 1, so that /3(TX t ) = f3(TX ) - 2. It- 
erating, we obtain transverse closed braids TX2 , TX3 , . . . , defining transverse knot types 
TIC 1, TIC2, TIC3, ■■ •, and no two have the same Bennequin invariant. Since stabilization 
does not change the topological knot type, the assertion follows. | 

At this writing, it is an open problem to find computable invariants of TIC which are 
not determined by \TK\tap and (3(TIC). A hint that the problem might turn out to be quite 
subtle was in the paper [72] by Fuchs and Tabachnikov, who proved that while ragbags 
filled with polynomial and finite type invariants of transversal knot types TIC exist, based 
upon the work of Arnold in [3], they are all determined by \TK\t ov and (3(TIC). Thus, the 
seemingly new invariants that many people had discovered by using Arnold's ideas were 
just a fancy way of encoding [T/C] top and (3(TIC). 

This leads naturally to a question: Are there computable invariants of transversal knots 
which are not determined by [TIC]t p and /3(TIC)? A similar question arises in the setting 
of Legendrian knots. Each Legendrian knot CIC determines a topological knot type [CIC]top, 
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and just as in the transverse case it is an invariant of the Legendrian knot type. There 
are also two numerical invariants of CIC: the Thurston-Bennequin invariant tb(CJC) (a self- 
linking number) and the Maslov index M(CK) (a rotation number). So until a few years ago 
the same question existed in the Legendrian setting, but the Legendrian case has recently 
been settled by Yuri Chekanov: 

Theorem 9 0Kj There exist distinct Legendian knot types which have the same topological 
knot type[CJC]top, and also the same Thurston-Bennequin invariant tb(CfC) and Maslov index 
M(jCJC). 

The analogous result for transversal knots proved to be quite difficult, so to begin to 
understand whether something could be done via braid theory, the first author and Nancy 
Wrinkle asked an easier question, which they answered in part in |34| : are there transversal 
knot types which are determined by their topological knot type and Bennequin number? 
This question lead to a definition: a transversal knot type TIC is transversally simple if it 
is determined by [TK] top and /3(77C). So the question is: are there transversally simple 
knots? The manuscript [31] gives a purely topological (in fact braid-theoretic) criterion 
which enables one to answer the question affirmatively, adding one more piece of evidence 
that topology and analysis walk hand in hand. To explain it, recall the destabilization and 
exchange moves, and the two flypes, depicted in Figure l20l A topological knot or link type 
K is said to be exchange reducible if an arbitrary closed braid representative X of K can be 
changed to an arbitrary representative of minimum braid index by braid isotopy, positive 
and negative destabilizations and exchange moves. We have: 

Theorem 10 [31] If a knot type K is exchange-reducible, then any transversal knot type 
TIC which has [TIC]top = K is transversally simple. 

This theorem was used to give a new proof of a theorem of Eliashberg jSUI , which asserts that 
the unlink is transversally simple, and also (with the help of |105| ) to prove the then- new 
result that most iterated torus knots are transversally simple. 

The rest of this section will be directed at explaining the main result of |31j : 

Theorem 11 [3J There exist transversal knot types which are not transversally simple. 
Explicitly, the transverse closed 3-braids TX + = o\o\o-\o~2 and TX_ = o~\o~2 o\o~\ deter- 
mine transverse knot types 77C + ,T/C_ with (T/C + ) top = (T/C_) top and (3(TfC + ) = (3(TK. ), 
butTIC+^TK-. 

Sketch of the proof of Theorem 111! See [31] for all details. The examples in Theorem 
1111 were obtained by choosing all the weights in the negative flype template to be 1, and 
assigning explicit 2-braids to the blocks P,Q,R of the negative flype template of Figure 
1201 If the weights are all chosen to be 1, the blocks P,Q,R are 2-braids and (except in 
very special cases) the flype templates have braid index 3. First one must show that the 
examples satisfy the conditions of the theorem. The topological knot types defined by the 
closed 3-braids o\a\o\o2 X and crfa^ 1 o~\a\ coincide because they are carried by the block 
strand diagrams for the negative flype template of Figure I2UI The Bennequin invariant can 
be computed as the exponent sum of the braid word (14 in both cases) minus the braid 
index (3 in both cases). So the examples have the required properties. 
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The hard part is the establishment of a special version of Theorem El which is applicable 
to the situation. Its special features are as follows: 

1. Both X + and X_ have braid index 3. 

2. Since it is well known that exchange moves can be replaced by braid isotopy for 
3-braids, the first two sequences in Theorem are vacuous, i.e. X± = X±. 

3. Because of the special assumption just noted, the templates that are needed, in the 
topological setting, can be enumerated explicitly: they are the positive and negative 
destabilization and the positive and negative hype templates. No others are needed. 

It is proved in [3^ that if X_ and X + are transversal closed braids TX + and TX_, 
then the isotopy that takes TX + to TX_ may be assumed to be transversal. So, suppose 
that a transversal isotopy exists from the transverse closed braid TX + to the transverse 
closed braid TX_. Then there is a 3-braid template that carries the braids afa^cfa^ 1 and 
a 1 02^o\o2- This is the first key fact that we use from Theorem El Instead of having to 
consider all possible transversal isotopies from TX + to TX_ , we only need to consider those 
that relate the left and right block-strand diagrams in one of the four 3-braid templates. So 
the braids in question are carried either by one of the two destabilization templates or by 
one of the two flype templates. If it was one of the destabilization templates, then the knots 
in question could be represented by 2 or 1-braids, i.e. they would be type (2, n) torus knots 
or the unknot, however an easy argument shows that the knots in Theorem ^2 are neither 
type (2, n) torus knots or the unknot. The positive hype templates are ruled out in different 
way: topologically, our closed braids admit a negative hype, so if they are also carried by 
the positive flype template they admit flypes of both signs. However, the manuscript 93 
gives conditions under which a closed 3-braid admits flypes of both signs, and the examples 
were chosen explicitly to rule out that possibility. 

We are reduced to isotopies that are supported by the negative flype template. It is 
straightforward to show that the obvious isotopy is not transversal, but maybe there is 
some other isotopy which is transversal. Here a key fact about the definition of a template 
is used (and this is a second very strong aspect of the MTWS). If such a transversal isotopy 
exists, then it exists for every knot or link defined by a fixed choice of braiding assignments 
to the blocks. Choose the braiding assignments erf , erf, erf 5 to the blocks P,R,Q. This 
braiding assignment gives a 2-component link L\ U L2 which has two distinct isotopy classes 
of closed 3-braid representatives. If L\ is the component associated to the left strand 
entering the block P, then (3{L\) = —1 and (3{L2) = —3 before the flype, but after the 
flype the representative will be afa^ 1 ^ 5 ^, with (3(Li) = —3 and (3{L2) = —1. However, 
by Eliashberg's isotopy extension theorem (Proposition 2.1.2 of 60J a transversal isotopy 
of a knot/link extends to an ambient transversal isotopy of the 3-sphere. Any transversal 
isotopy of L\ U L2 must preserve the /^-invariants of the components. It follows that no 
such transversal isotopy exists, a contradiction of our assumption that TX + and TX_ are 
transversally isotopic. | 

Other examples of a similar nature were discovered by Etnyre and Honda [01] after 
the proof of Theorem was posted on the arXiv. Their methods are very different from 
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the proof that we just described (being based on contact theory techniques rather than 
topological techniques), but are equally indirect. They do not produce explicit examples, 
rather they present a bag of pairs of transverse knots and prove that at least one pair in the 
bag exists with the properties given by Theorem 111! Therefore we pose, as an important 
open problem: 

Open Problem 4 Find new computable invariants of transversal knot types. Here 'new' 
means an invariant which is not determined by the topological knot type TX and and the 
Bennequin invariant [3{TX). 'Computable' means that it should be computable from either 
a closed braid diagram or some other representation of the transversal knot. Braid groups 
seem to be a natural setting for investigating this problem. £ 

Remark 3.1 The connections between closed braids and contact structures does not end 
with transversal knots. There are fundamental relationships between open book structures 
on 3-manifolds and contact structures on those manifolds, both untwisted and twisted. 
See [ZHl for an introduction to this interesting new area, and see [HSj for a review of the 
mathematics and a discussion of many open problems waiting to be investigated. 
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4 Representations of the braid groups 



Before the discovery of Hecke algbra representations of the braid group (discussed in this 
section) very little was known about finite dimensional but infinite representations of B„, 
except for the ubiquitous Burau representation. That matter changed dramatically in 1987 
with the publication of |89| . Suddenly, we had more knot invariants and with them more 
braid group representations than anyone could deal with, and the issue became one of 
organizing them. However, we shall not attempt to give a comprehensive overview of the 
rich theory of representations of braid groups in this section. Instead, we focus here on 
the representations of B„ which have played the greatest roles in the development of that 
theory: the Burau representation, the Hecke algebra representations, and, more recently, 
the Lawrence-Krammer representation. 

4.1 A brief look at representations of E n 

The fact that the representation theory of B„ is rooted in the representation theory of the 
symmetric group T, n is a consequence of the surjection B n — > S n; given by mapping the 
elementary braid U{ to the transposition Sj = (i, i + 1)- While the kernel of this homomor- 
phism is very big (it's the entire pure braid group P n ), it nevertheless turns out that a great 
deal can be learned about representations of B n by studying the collection of irreducible 
representations of S n , and attempting to lift them to reprentations of B n by 'deforming' 
them, and hoping that you get something new. For the record, we now note that the group 
algebra CS n has generators 1, si, . . . , s n _i and defining relations: 

SiSk = s k Si if \i — k\ > 2, SiS i+ iSi = s i+ \SiS i+ \, sf = 1, (13) 

where 1 < i ^ k < n — 1. As a vector space, CS n is spanned by n! reduced words in 
the transpositions sf. 

{{Si 1 Si 1 — \ . . . Sj 1 _fc 1 )(Sj 2 Sj 2 _l . . . Sj 2 _fc 2 ) • • • (Si r Xi r — l . . . Sj r _fc r )} (14) 

where 1 < ii < 12 < ■ ■ ■ < i r < n — 1 and ij — kj > 1. 

Irreducible representations of T, n are parametrized by Young diagrams, which in turn 
are parametrized by partitions of n. (See |73j for one of the many good discussions of 
this subject in the literature.) A Young diagram consists of stacked rows of boxes, aligned 
on the left, with the number of boxes in each row strictly nonincreasing as you go from 
top to bottom. The number of boxes in a row corresponds to a term in a given partition 
of n. For example, the Young diagram shown in Figure E21 corresponds to the partition 
8 = 4 + 2 + 1 + 1. 



Figure 22: The Young diagram corresponding to the partition 4 + 2 + 1 + 1. 
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As it turns out, the irreducible representations of the symmetric group are in 1-1 cor- 
respondence with Young diagrams, or equivalently with partitions of n. There is a row- 
column symmetry, and we adopt the convention that the Young diagram which has one 
row of n boxes, which comes from the partition n = n, corresponds to the trivial repre- 
sentation. Then the Young diagram which has n rows consisting of 1 box per row, which 
comes from the partition n = 1 + 1 + • • • + 1, corresponds to the parity representation, 
mapping each element of T, n to its sign. The first interesting representation of a braid 
group occurs when n = 3, corresponding to the partition 3 = 2 + 1. More generally, we 
consider the partition n = (n — 1) + 1, or the Young diagram with two rows, the top row 
having (n — 1) boxes and the bottom having a single box. This Young diagram corresponds 
to the standard representation S n — > GL n (C), given by the usual action of S n on C n by 
permuting basis vectors. Thus if Si denotes the transposition {i i + 1) € S n , the standard 
representation sends Si to the following: 



where Ik denotes the k x k identity matrix. 

The standard representation is reducible; it is easy to see that it fixes a 1-dimensional 
subspace, namely, the span of the sum of the basis vectors. The complementary (n — 1)- 
dimensional subspace is the set of all points (zi, . . . , z n ) 6 C n such that Z\ + . . . + z n = 0. 
This subspace corresponds to the irreducible representation given by the Young diagram in 
question. One can use the hook length formula to calculate directly that the dimension of 
the representation corresponding to this Young diagram is indeed n — 1 (we refer the reader 
again to [ZS] for an explanation of this formula) . 

4.2 The Burau representation and polynomial invariants of knots. 

Burau first introduced his representation of the braid group in 1936 [32j. Much later, it 
was realized that it could be thought of as a deformation of the standard representation 
of corresponding to the partition n = (n — 1) + 1. For many years it was the focus of 
the representation theory of braid groups. We define the Burau representation p : B n — > 



Note that substituting t = 1 gives back the representation (of B n factoring through S n ), and 
this is why we say that it is a deformation of the standard representation of S n . Like the rep- 
resentation of S n , the Burau representation splits into a 1-dimensional representation and an 
(n — 1) -dimensional irreducible representation known as the reduced Burau representation 
which we denote by p : B n — > GL n (Z[t, i -1 ]) as follows: 




GL n (Z[M -1 ]) as follows: 




cn >-» h-2 ffi -t I, 




n—i—2 



where the —t in the middle of the 3x3 matrix is always in the (i, i) th spot. 
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It has been known for a long time that the Burau representation is faithful for re < 3 
(see ^Hl> f° r example), and for many years the representation was held to be a reasonable 
candidate for a faithful linear representation of B n for all n. However, in 1991, Moody 
showed that for n > 9, the Burau representation is not faithful |1(J7| . Long and Paton later 
improved Moody's result to n > 6 |ll)3j . Bigelow further improved this to n > 5 |13| . At 
the time of this writing, the case re = 4 remains open. 

Despite such results, the Burau representation continues to play an important role in 
the study of representations of B n . It was known classically that the Alexander polynomial 
Ajc(t) of a knot or link K can be calculated directly from the image under the reduced 
Burau representation of a braid X such that b(X) represents K, as follows (see [T%] . e.g., 
for a proof based upon Theorem |1J the Markov Theorem): 

det(p(X) - J n -x) 

A »W(*)= 1+f + ... +t n-l - (15) 

Thus the Alexander polynomial of the closed braid associated to the open braid X, i.e. 
Af,rx){t), is a rescaling of the characteristic polynomial of the the image of X in the reduced 
representation. In what follows, we shall see how a property of the Burau representation 
motivated the definition of a variant on the two- variable HOMFLY polynomial [73], a knot 
invariant of which both the Alexander polynomial and the Jones polynomial are specializa- 
tions. Further, in <J4.4I we shall give a topological interpretation of the Burau representation 
which naturally leads to the definition of a faithful linear representation of B n known as 
the Lawrence-Krammer representation, which will be our focus in H4.51 



4.3 Hecke algebras representations of braid groups and polynomial in- 
variants of knots 

A simple calculation, together with the Cayley-Hamilton theorem, shows that the image 
of each of our braid group generators under the Burau representation, p(o~i), satisfies the 
characteristic equation x 2 = (1 — t)x + t and thus has two distinct eigenvalues. This 
prompted Jones to study all representations p : B n — > GL n (C) which have at most two 
distinct eigenvalues [SH]- Let Xi = p{o~i). Then for all i, X{ must satisfy a quadratic equation 
of the form x\ + axi + 6 = 0. By rescaling, we may assume that one of the eigenvalues 
is 1 and eliminate one of the variables, e.g., a = —(1 + 6). Note that by rewriting our 
quadratic equation and making the substitution 6 = — t we regain the characteristic equation 
from the Burau representation. However, the convention in the literature seems to be to 
rescale our representation by (—1) so that the equation takes the form x\ = (t — l)xi + t. 
With this motivation, we define the Hecke algebra H n {t) to be the algebra with generators 
1, xi, . . . , x n -\ and defining relations as follows: 

XiX k = x k Xi if \i - k\ > 2, XiXi+iXi = x i+1 XiX i+ i, xf = (t - l)x; + t, (16) 

where 1 < i ^ k < n—1. Comparing the relations in (|13|) and (|16j) . we see that H n (l) = CS n , 
the group algebra of the symmetric group. Hence we can think of H n (t) as a 'deformation' 
of CE n . 

The connection between H n (t) and CT, n is made even more transparent by noting that as 
a vector space, H n (t) is spanned by re! lifts of a system of reduced words in the transpositions 
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Si S S n . For example, we can take as a spanning set 

{(X{ 1 Xi 1 — i . . . Xi 1 — k 1 ^(xi2Xi 2 — l • • • Xi 2 — fc 2 ) ' ' ' (%i r %i r — 1 • • • -^ir— A; r )} (l^) 

where 1 < ii < ^2 < • • • < i r < n — 1 an d ij — kj > 1 |35j . |89| . 

Remark 4.1 In ^1.4.2l we denned an algebra J ra (i) with generators 1, gi , . . . , g n -i an d defin- 
ing relations 0, the Jones algebra. As it turns out, its irreducible summands are in 1-1 cor- 
respondence with the irreducible representations of the Hecke algebra that are parametrized 
by Young diagrams with exactly 2 rows. The Hecke algebra, as defined above, has gener- 
ators maps l,a;i, . . . ,x n -i with defining relations (|16|). so that the map £ : H n (t) — ► J n (t) 
that is defined by £(x«) = g% is a homorphism of algebras. 

Our main purpose in this section is to outline Jones' development in |55] of a two- variable 
polynomial knot invariant arising from representations of the Hecke algebras H n (t). This 
polynomial is essentially the well-known HOMFLY polynomial, and includes the Jones 
polynomial as a specialization. We have just seen that the Jones algebra is the quotient of 
the Hecke algebra H n (t) by one extra relation (as an aside, we note that this extra relation 
is satisfied by the image of the transpositions generators S{ = (i, i + 1) in S n under all 
representations arising from Young diagrams with at most two rows.) To pick up a second 
variable, we introduce an additional parameter by allowing a 1-parameter family of traces 
on Hecke algebras. We now pursue this point of view, and we also refer the reader to [S3] 
for another exposition which follows the same point of view. 

We begin by defining a function / : B n — ► H n (t) by /(crj) = Xj. The function / is well- 
defined on reduced words in the generators <7j and commutes with the natural inclusions 
B n _i C B n and H n _\(t) C H n (t), although in general / fails to be a homomorphism. We 
can then apply the following result due to Adrian Ocneanu which appeared in [J^j and was 
proved inductively in jSHj using the n!-element basis given above. 

Theorem 12 [7S], [SH] For each z G C* (and each t S C*^, there exists a unique trace 
function tr : U^ =1 H n (t) — > C such that 

1. tr(l) = 1 

2. tr(afe) = tr(ha) 

3. tr is C-linear 

4- tr(nx n _iu) = z tr(uv) for all u,v £ H n —i(t). 

Theorem^Jgives us a one-parameter family of trace functions on a one-parameter family 
of algebras. In fact, using the properties of the trace function given in theorem it is possible 
to compute tr(/(X)) for all X £ B n . (We note the fact that for any w £ H n (t) such that 
w ^ H n _i(t), there is a unique reduced word w = X{ x ■ ■ ■ Xi r in which x n _i appears exactly 
once jHS]-) In practice, the third relation of H n (t) is quite useful for computing the trace 
function tr, both in its original form and in the following: 

x^ 1 = t~ 1 x l + (t- 1 - 1). 
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Example 4.1 Let X± = erf £ B2, and let X2 = cria^ 1 cricr^ 1 £ B3. Note that Xl is the 
right-handed trefoil knot and that X2 is the Figure-8 knot. We invite the reader to check 
that 

tv(f(X 1 )) = (t 2 -t+l)z + t(t-l) 

and that 

tr(/(X 2 )) = (3 - r 1 - t)t- l z 2 + (3 - r l - t){r l - i)z - (2 - r l - 

We also note that the second property of the trace function given in Theorem 1121 implies 
that tr o f is invariant on conjugacy classes in B n . It remains to tweak the function a 
bit in order to obtain from a given braid a two-variable polynomial which is also invariant 
under stabilization and destabilization moves as defined in H2. 3| such a polynomial will be 
Markov-invariant and hence an invariant of the knot type of the closed braid. 

Algebraically, stabilization and destabilization each take the form X — > Xa^ 1 , the 
only difference being appropriate conditions on the braid X. We would like to rescale our 
representation / in such a way that both versions of stabilization (resp. destabilization) 
have the same effect on the trace function. Suppose there exists a complex number k such 
that tr(/cxj) = tr((/cxj) -1 )). Then we can find a 'formula' for k as follows: 

k 2 ti(xi) = tr(x~ 1 ) 

k 2 z = tr(t _1 Xi + r 1 - 1) 
t~ x z + t- x - 1 



k 2 



Solving this for z, we obtain 



z 

1 + z - t 
tz 

l-t 



l-k 2 t 

We set k = k 2 , and define f K : B n — > H n (t) by = y/ii o%- Now we have 

tr(/ K (cr n )) = \[k z 

r- 1-* 

= -VK- T- 

1 — Kb 

We would like to define a map B n — > Z[t , k^ 1 ] which is Markov invariant. At the 
moment, we have that 

tr(/(™ • v^7n)) = -v^— ^tr(/(«;)) = tr(/(w • ^a~ 1 )) 

L — Kt y/t 

for any w £ B„. Now we simply define 

1 1 — Kt 

F(X)=F x (t,«) = (_L . i_5)»-itr(/„(JT)) 
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for X £ B n , where E is the exponent sum of X as a word in a±, . . . , cr n -i- It is clear that 
F{X) depends only on the knot type of b(X). We now reparametrize one last time, setting 

I = \f~K\Tt 

m = Vt -p. 

y/i 

With this substitution, we obtain a Laurent polynomial in two variables I and m, which 
we denote P^x)(l,'m) = P^(Z,m), where K is the (oriented) knot or link type of b(X). 
Furthermore, Pft-(Z,m) satisfies the skein relation 

mP Ko = l~ l P K+ - IP K _ (18) 

where Kq, K + , and K_ are oriented knots with identical diagrams except in a neighborhood 
of one crossing, where they have a diagram as given in Figure [231 (Proposition 6.2 of [EH])- 
Thus by beginning with Pjj = 1, where U denotes the unknot, it is possible to calculate Pk 
for any knot or link K using only the skein relation, which is often simpler than using the 
trace function. 

v V ^ 

/\ /\ r\ 

Figure 23: Crossings for K + ,K_, and Kq, respectively, in the skein relation given in (|18jl. 



Remark 4.2 We note that the polynomial P^-(/,m) obtained in this way is essentially 
the same as the two- variable polynomial known as the HOMFLY polynomial ([ZE]) which 
is usually reparametrized as Pk[}1~ 1 -, im). 

Example 4.2 Let X±, Xi be the braids defined in Example 14.11 whose closures are K\ = 
the right-handed trefoil knot and K2 = the Figure-8 knot, respectively. We leave it as an 
exercise for the reader to check that 

F Xl (t, k) = k(1 + t 2 - Kt 2 ) 

= Kt(t + t" 1 - Kt) 

= nt(2 - nt + 1 + r l - 2) 

1 ^2 



Kt 2 - Kt + (Vt 



and hence we have 

P Kl (l,m) =2/ 2 -/ 4 + / 2 m 2 . 
Similarly, the reader can check that 



2+2 



, N 1 - k(1 -t + t 2 ) + kH 
Fx 2 {t,n) = ^ >- 

tK 

P K2 (l,m) = r 2 -m 2 -l + l 2 . 
For explicit calculations of Fx 2 and Pk 2 using the trace function, see p. 350 of 89 .4 
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Remark 4.3 There is also a 1-variable knot polynomial, the Jones polynomial, associated 
to the algebra J n (t) generated by 1, gi, ... , g n -i with defining relations (J7J). In the situation 
of the Jones algebra the trace is unique, whereas in the situation of the Hecke algebra, as we 
presented it here, there is a 1-parameter family of traces. The 1-variable Jones polynomial 
was discovered before the 2-variable HOMFLY polynomial. 

This two- variable knot polynomial has been much studied and reviewed in the literature. 
For the sake of completeness we list here a few of its noteworthy properties and applications. 

1. Connect sums: Pkx#k<2 = Pk 1 ■ Pk 2 

2. Disjoint unions: P Kl UK 2 = { L ~^ ]l )Pk 1 • Pr 2 - 

3. Orientation: = Px, where K denotes the link obtained by reversing the orien- 
tation of every component of the link K. 

4. Chirality: Pf>(l,m) = Pk{1 m), where K denotes the mirror image of the link 
K. 

5. Alexander polynomial: Note that Fx(1,k) is not defined. It comes as something 

of surprise, then, that the specialization I = l,m = y/i \= gives the Alexander 

vt 

polynomial A^(t). Jones shows how to avoid the singularity by exploiting an al- 
ternate method of calculating the trace function using weighted sums of traces (see 
|132| and j^U] as well as [73] and [EH])- A by-product of this alternate method is 
another derivation of Equation El showing how to calculate A^(£) from the Burau 
representation. 

6. Jones polynomial: The famous Jones polynomial can be obtained from the two- 
variable polynomial by setting 

v K {t) = p{t,Vt-\). 

v* 

Note that we are abusing notation by reusing the variable t here and above in A#-(t). 

7. A lower bound for braid index At the end of ^12. 21 we remarked that it is an 
open problem to determine the braid index of a knot algorithmically. However, the 
HOMFLY polynomial does give a remarkably useful lower bound, via a famous in- 
equality which is known as the Morton-Franks- Williams inequality. It was proved 
simultaneously and independently by Hugh Morton in jlllj and by John Franks and 
Robert Williams in |74| . While it proved to be sharp on all but 5 of the knots in the 
standard tables of knots having at most 10 crossings, there are also infinitely many 
knots on which it fails to be sharp. The first author came to a new appreciation of 
the importance of this problem when she was faced with the problem of determining, 
precisely, the braid index of certain 6-braid knots on which it failed. Consulting many 
people, it became apparent that there was essentially no other useful result on this 
basic question. 
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4.4 A topological interpretation of the Burau representation 

We have seen that the Burau representation is deeply connected with the topology of closed 
braids. We shall now give a fully topological definition of the Burau representation. We 
outline here ideas given in |13| : for full details see that or Turaev's excellent survey article 
|13U| . among others. 

Let D n be an n-punctured disk, which we will think of as a disk D with n distinguished 
points qi, . . . , q n . Choose a point do £ dD n to serve as the basepoint. Then 7Ti (D n , do) is free 
on n generators which can be represented by loops x% based at do travelling counterclockwise 
about the puncture % for i = 1, . . . , n. We define a surjective map e : ivi(D n , do) — > Z as 
follows. Let 7 = x™ 1 • • • x™ r G iri(D n , do)- Then we define the exponent sum e{pf) = J2l=i n i- 
The integer e(j) can be interpreted as the total algebraic winding number of 7 about the 
punctures {qi}, i.e., the sum over alH = 1, . . . , n of the winding number of 7 about qi. Now 
there is a regular covering space D n of D n corresponding to the kernel of the map e. Since 
e : iri(D n ,do) — > Z is surjective, the group of covering transformations Aut(£)„) = Z. Let r 
be a generator of Z, and let A = Z[i, f ]. Then H\{D n ) inherits a A-module structure from 
the action of the covering transformations: we simply set t ■ 7 = ^(7), where 7 S H\(D n ) 
and r* denotes the induced action on homology. We note that as a A-module, H\(D n ) is 
free of rank n — 1. 

In what follows it will be convenient to think of B n as in 31.31 i.e., as the mapping class 
group of D n where dD n is fixed pointwise, while the punctures may be permuted. We shall 
abuse terminology by not distinguishing between a mapping class and a diffeomorphism 
which represents it. Therefore we think of any X € B n as a map X : D n — > D n . Then X 
lifts uniquely to a map X : D n — * D n which fixes the fiber over the basepoint do pointwise. 
Furthermore, X induces a A-module automorphism X* of H\(D n ). Since H\(D n ) is a free 
A-module of rank n — 1, we can now define a map B n — ► GL n _i(A) by X 1— > A*. This map 
turns out to be equivalent to the reduced Burau representation defined previously (see [HE] 
for a classification of linear representations of the braid group of degree at most n — 1). 

The main idea of Stephen Bigelow's proof of the non-faithfulness of the Burau represen- 
tation in the case n = 5 is contained in the following theorem: 

Theorem 13 ^5] For n > 3, the Burau representation p : B n — * GL n _i(A) is not faithful 
if and only if there exist arcs a, (5 embedded in D n satisfying: 

1. da = {q 1 ,q 2 } and d(3 = {d , q 3 } or{q- i ,q 4 \. 

2. a intersects (3 nontrivially (more precisely, there exists no isotopy rel endpoints which 
carries a off 0) 

3. For some choice of lifts a,/3, we have Ylkezi^&i @)t k = 0, where (x,y) denotes the 
algebraic intersection number of two (oriented) arcs in D n . 

We note that the case d(3 = {do,q%} follows from Theorem 1.5 of |103| . Bigelow has 
produced an explicit example of arcs a and (3 satisfying the criteria of Theorem ED i n the 
case n = 5 (see p. 402 of ^H])- It follows that the Burau representation of B n is not faithful 
for n > 5. It has been known for many years that it is faithful for n = 3. 

Open Problem 5 Is the Burau representation of B4 faithful? 4k 
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In the next section, we will explore another representation of B n , with a topological 
interpretation analogous to that of the Burau representation. 

4.5 The Lawrence-Krammer representation 

In 1990, Ruth Lawrence introduced a family of representations of B„ corresponding to 
Young diagrams with two rows which arise out of a topological construction of representa- 
tions of the Hecke algebras H n {t) jlUUj . Later, Krammer gave an entirely algebraic definition 
of one of these representations: 

A : B„ , -> GL r (A) 

where A = Z[£ , q ] and showed that it was faithful for n = 4 |97j . The representation 
A has become known as the Lawrence-Krammer representation. Shortly after Krammer's 
result appeared, Bigelow was able to use topological methods to show that A is faithful for 
all n |14j . Krammer later gave an algebraic proof of the same result [HE] • Therefore we now 
have: 

Theorem 14 [TI|, [55] The map A is a faithful representation of~B n , and hence B n is a 
linear group for all n. 

We observe that Krammer's result in |97| also implies that Aut(i<2), the group of automor- 
phisms of a free group of rank 2, is linear. 

We shall not present here a full proof of Theorem I14| our aim is to define the repre- 
sentation A, to present the key ingredients in Bigelow's proof: 'forks', 'noodles', and the 
pairing between them, and to describe Bigelow's characterization of the kernel of A in terms 
of these ingredients. 

As mentioned previously, Bigelow's topological definition of A is somewhat analogous to 
the topological definition of the Burau representation given in the previous section. This 
time, we begin the construction with a certain configuration space of a punctured disk 
rather than the punctured disk itself. In the notation of £11.11 we let C = Co^(D n ), where 
D n is the n-punctured disk described in the previous section. Elements of C will be denoted 
{z\,Z2} = {z2,zi}, where Z\ / zi £ D n . Now we choose two distinct points di,d,2 £ dD n , 
and let cq = {cii,^} be the basepoint of C. We are now in a position to define a map 
4> : 7Ti(C, Co) — > Z x Z as follows. Let [7] € 7Tl(C, cq). Then the loop 7(5) in C can be 
expressed as 7(5) = {71 (s), 72(5)}, where 7» is an arc in D n for i = 1,2. 

We define two integers a, b as follows. 

1 f dz f dz 

27ri f^Jii z - 1i J 12 z ~ <U 

i_ r dz 

'•'71—72 

Since 7 is a loop in C, we either have that both 71 and 72 are both loops, i.e., 71 (0) = 71(1) 
and 72(0) = 72(1), or else, as Bigelow puts it, the 7« are arcs which 'switch places', i.e., 
7i(l) = 72(0) and 72(1) = 7i(0). In the first case, Bigelow observes that we can interpret 
a as the sum of the total algebraic winding numbers of 71 and 72 about the punctures qi 



a = 
b = 
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and b as twice the winding number of 71 and 72 about each other. In the second case, the 
product 7172 is a loop, and so we still have a nice interpretation of a as the total algebraic 
winding number of 7172 about the punctures Bigelow also points out that in this case 
the condition 71 — 72XI) = — (71 — 72) (0) implies that b is an odd integer. It is worth 
noting that Turaev |13Uj interprets the integer 6 in a different way (in both cases). The 
composition of the map 717 (C) — ► S 1 defined by 

^ 7i(g) ~ 72(g) 
l7i(»)-72(s)| 

with the usual projection from S 1 onto MP 1 sends our loop 7 to a loop in MP 1 . Let 7 denote 
the homology class of this loop in Pi (MP 1 ). Up to a choice of generator u for Pi (MP 1 ) = Z, 
we have 7 = u b . We define the map <fi by setting 4>{l) = Q a t £ Z x Z, thought of as the 
free abelian group with basis {q, t}. 

Now we proceed more or less as in the case of the Burau representation. Let C be the 
regular cover of C which corresponds to the kernel of <j> in 717 (C, cq). Let A denote the ring 
Z[i ±:L , q ]. The homology groups of C naturally inherit an A- module structure via the 
action of Aut(C). 

It is clear that any homeomorphism X : D n — > D n induces a homeomorphism X' : C — > 
C defined by X'({zi, Z2} = {X (zi) , X (Z2)} ■ Thus X' necessarily fixes the basepoint cq. 
If X^ denotes the induced action on 717 (C), then it is an easy exercise to check that X^ 
preserves the values of the integers a and b defined above. Then X' lifts uniquely to a map 
X : C — ► C such that the fiber over the basepoint cq is fixed pointwise. Moreover, since X 
commutes with the action of Aut(C), we have that the induced action on H2(C), denoted 
X*, is an A-module automorphism. (Note that C is a 4-manifold.) Now H2{C) is a free 

A-module of rank r = f ™ ~\ (Theorem 4.1 of |14j ) and thus we can think of X* G A. We 

therefore define the Lawrence-Krammer representation as follows: 

A : B n -> GL r (vl) 
X h-> X* 

If we choose q and t to be algebraically independent in C, we obtain a faithful representation 
A : B n -> GL r (C). 

Recall that di,d,2 = Co is the basepoint we have chosen for the configuration space C. 
A fork is a tree F embedded in the disk D such that 

1. F has four vertices: d%,z,qi and qj, 

2. the three edges of F share z as a common vertex, 

3. FndD n = di, and 

4. Fn{qx,...,q n } = {qi,qj}- 

We note that Krammer's algebraic definition of the representation A depends on the induced 
action on an A-module generated forks. 



54 



The edge of F which contains the vertex d\ is called the handle of F. The union of 
the other two edges is called the tine edge of F, denoted T(F), and may contain punctures 
qk,k / i, j. For any fork F we define a parallel copy F' to be a copy of the tree F embedded 
in D, with vertices {d,2,z',qi and qj such that d\ / d 2 £ dD n , z ^ z', and F' is isotopic to 
F rel Given a fork F, we also define a noodle to be an arc N properly embedded 

in D n with dN = {d±, d 2 } and oriented so that its initial point is d\ and its terminal point 
is d 2 . 

We next construct surfaces in C associated to forks and noodles with which we will 
define a certain pairing on forks and noodles. For each fork F, we choose a parallel copy 
F' and first define a surface S(F) as the set of all points of the form {x, y} £ C with 
x £ T{F) \ {q%, . . . ,q n } and y 6 T(F') \ {q±, . . . ,q n }- Let ( (resp. (') denote the handle 
of F (resp. F'), oriented from d\ to z (resp. d 2 to z'). Define C to be the unique lift of 
{C(s), C'( s )} ^ C with initial point do, a fixed point in the fiber over cq. Now let S(F) C C 
be the unique lift of S(F) which contains the terminal point of £. Similarly, we can associate 
to each noodle N a surface S(N) C C. Define S(N) C C as the set of all points of the 
form {x, y} G C such that x,y £ N,x / y, and let S(N) be the unique lift of S(N) which 
contains cq. 

Given a noodle and a fork F, we define their pairing, denoted (N,F), to be an 
element of the ring A as follows. We can assume that (up to isotopy rel endpoints) T(F) 
and iV intersect transversely in a finite number of points {zi, . . . , z r } and similarly that 
T(F') intersects N transversely at {z[, . . . ,z' r }. Then each pair Z{,z\ cobounds an arc in 
N which lies between T(F) and T(F'). Now for each i,j = l,...,r, there is a unique 
monomial rriij = q ai 'H bi >i such that rriijS(N) intersects the surface S(F) at a point in the 
fiber over Zi,z'j £ C. Letting ejj denote the sign of that intersection, we define 

r r 

{n,f) = Y j Y. € ^ hK3 - 

t=l j=l 

Bigelow supplies a method for explicit calculation of the pairing for a given noodle and fork 
and shows that it is well defined on isotopy classes of forks rel qi,qj. 

The proof that A is faithful relies on two important lemmas (the 'Basic Lemma' and 
'Key Lemma' of 14 ]), which we present here. As usual, we shall abuse notation and not 
distinguish between a mapping class and specific representatives. 

Lemma 4.1 14 If X(X) = 1, then (N,F) = (N,X(F)) for every noodle N and every fork 
F. 

Lemma 4.2 |14j For a noodle N and a fork F, the pairing (N, F) = if and only ifT(F) 
can be isotoped off N relative to qi,qj- 

Thus it is the pairing on noodles and forks which gives the essential characterization of the 
kernel of A. 

We end this section by noting two facts about the Lawrence-Krammer representation. 
First, Budney has shown that A is unitary 1 for an appropriate choice of q and t (still 

1 Squier had previously shown that the Burau representation is unitary |125| . 
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algebraically independent) jlj. In other words, we have 

A : B n - U r (C) 

where U r = {X £ GL r (C)|X T = X" 1 }. Thus the conjugacy class of X(X) G l7 r (C) is 
determined by its eigenvalues, and one could hope for an efficient solution to the conjugacy 
problem for B n by passing to the whole of U r (C). However, Budney has given examples of 
non-conjugate braids Xi,X 2 G B n such that A(Xl) and X(X 2 ) are conjugate in U r (C) (see 
Section 4 of |41j). We will return to these examples in a bit more detail in <j5.7l 

Second, Matthew Zinno has shown a connection between the Lawrence-Krammer repre- 
sentation and the Birman-Murakami-Wenzl (BMW) algebra. The BMW algebra is related 
to Kauffman's knot polynomial and can be thought of as a deformation of the Brauer al- 
gebra in the same way that the Hecke algebra can be thought of as a deformation of the 
group algebra CS n (see [33], |114j ). Braid groups map homomorphically into the BMW 
algebra, giving rise to irreducible representations of B„. Zinno has identified a summand 
of the BMW algebra which corresponds exactly to the Lawrence-Krammer representation: 

Theorem 15 |137j The Lawrence-Krammer representation ofT5 n is equivalent to the (n — 
2) x 1 irreducible representation of the BMW algebra. 

It follows immediately from Theorem El that the Lawrence-Krammer representation A is 
irreducible and also that the regular representation of the BMW algebra is faithful. 

4.6 Representations of other mapping class groups 

The first theorem that we proved in this review article was Theorem^ which asserted that 
the braid group B n _i has a natural interpretation as the mapping class group A^o,i,n-l of 
the n-times punctured disc. One wonders, then, whether the linearity of B n extends to a 
more general statement about the linearity of other mapping class groups M.g t b,n^ 

The manuscripts of M. Korkmaz j^H] and of S. Bigelow and R. Budney ^1 exploit the 
very special connection between the braid group B n _i = .Mo,i,n-i and the mapping class 
group -Mo,o,n 5 and between the mapping class groups -Mo,o,2c/+2 and M.g t o,o, to produce 
failthful finite dimensional representations of A^o,o,n for every n, also of A^2,o,0! and finally 
of a particular subgroup of .M 9i o,o fo r every g > 3. In the case of .M2,o,o> the basic fact used 
by both Korkmaz and Bigelow-Budney is a theorem proved by Birman and Hilden in 1973 
(see |18j). which asserts that the mapping class group M.2,0,0 is a Z2 central extension of 
-Mo, 0,6- This theorem is special to genus 2, which is the only group among the mapping class 
groups -M 9j o,o which has a center. For g > 3 the so-called 'hyper elliptic involution' which 
generates the center of M.2,0,0 generalizes to an involution whose centralizer is a subgroup 
of infinite index in -M 9 .o,o- The same circle of ideas yield faithful matrix representations of 
those subgroups. 

We now explain how Bigelow and Budney used the Lawrence-Krammer representation 
of the braid groups to obtain faithful matrix representations of A^oo,n.- If one considers 
any surface S g ^^ n and caps one of the boundary components by a disc, one obtains a 
geometrically induced disc-filling homomorphism d± : M.g t b, n -i Mg,b-i,n- Its kernel is the 
Dehn twist about the distinguished boundary component (see §2.8 of [EH]); a distinguished 
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point in the interior of the new disc becomes the new fixed point. Applying these ideas to 
the braid group, one then sees that there is a natural homomorphism d* : B n _i — ► M.o,o,n- 
It turns out that image(d lt ) is not the full group Mo,i,m but the stabilizer of the new 
fixed point, and kernel(d*) is the infinite cyclic subgroup of B n _i that is generated by the 
braid C = {p n -\ G n~'i ■ ■ ■ <J 2 cr i) n i a full twist of all of the n braid strands. Since the image 
A(C) in the Lawrence-Krammer representation is a scalar matrix (the diagonal entries are 
g,2(n-i)^2 j n ^ e re p resen tation as it is defined in JI]), one obtains a faithful representation of 
image(d*) by rescaling the Lawrence-Krammer matrices, setting t~ 2 = g 2 ( n_1 ). The group 
image((i*) is of finite index in A^o,o ni which therefore is a linear group. The dimension of the 

f n - 1 \ 2 

explicit representation of A4o,o,n constructed in ^S] is n ( j . It leads to a related 

representation of dimension 64 of A^2,o,o- I n this regard we note that, while Korkmaz uses 
the identical geometry, he uses less care with regard to dimension, and his representation 
of 7^2,0,0 has dimension 2 10 3 5 5 3 , which is very much bigger than 64. 

Open Problem 6 Is there a faithful finite dimensional matrix representation of the map- 
ping class group M. g fi,n f° r an Y values of the triplet (g,b,n) other than (0, 1, n), (0, 0, n), 
(2, 0, 0), (1, 0, 0) and (1, 1, 0)? A folklore conjecture is that most of the mapping class groups 
are, in fact linear. See [2Jj for evidence in this regard. New ideas seem to be needed to 
construct candidates. A 

In view of the very large dimension of the representation of A^.o.o which we just dis- 
cussed, we note that there is an interesting 5-dimensional representation of the same group 
which occurs as one of the summands in the Hecke algebra representation of B n , namely 
the one belonging to the Young diagram with 2 rows and 3 columns. It is discussed in |89| . 
At this writing its kernel does not seem to be known, although it is known to be infinite. 



4.7 Additional representations of B„. 

We end our discussion of representations of the braid groups by describing a construction 
which yields infinitely many finite dimensional representations of B n , each one over a ring 
C[ti, t± , t2, ^2 > ■ ■ ■ j^k, tfc 1 ], where t\, . . . are parameter, for some k > 1. The construc- 
tion includes all the summands in the Temperley-Lieb algebra and the Lawrence-Krammer 
representation too, and in addition infinitely many presumably new faithful representations 
of B„. It was first described in [2E], generalizing ideas in It is due to Moody, with 

details first worked out by Long in |lU2j . 

We will be interested in the braid group B n on n-strands, but to describe our con- 
struction it will be convenient to regard B n as a subgroup of B n+ i. Number the strands 
in the latter group as 0,1,..., n. Let B 1>n C B n+ i be the subgroup of braids in B n+ i 
whose associated permutation fixes the letter 0. Its relationship to B„ is given by the group 
extension 

1 - F n - Bi, n - B n - 1, (19) 

where the homomorphism Bi ; „ — > B n is defined by pulling out the zero th braid strand. 
There is a cross section which is defined by mapping B n to the subgroup of braids on 
strands 1, . . . ,n in Bi „. Therefore we may identify Bi n with F n xB n . The semi-direct 
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product structure arises when we regard B n as a subgroup of the automorphism group of 
a free group. The action of B n on F n is well known and is given in also |65| . and also 
in ^Hj. Thinking of F n as the fundamental group it\{SQ^ n ) of the n-times punctured disc, 
the action of the elementary braid cr, is given explicitly by: 



Since B n is a subgroup of B n+ i we see In this way that the groups B ra , F n and also F n xB„ 
are all subgroups of B n+ i. 

In order to describe the idea behind the construction we recall the notion of homology 
or cohomology of a space with coefficients in a flat vector bundle. Suppose that X is a 
manifold and that we are given a representation p : n\{X) — > GL(V). This enables us to 
define a flat vector bundle E p : Let X be the universal covering of X. The group tti(X) 
acts on X x V by g.(x, v) = (g.x, p(g).v). Then E p is the quotient of X x V by this action. 
We now form the cohomology groups of 1-forms with coefficients in E p , denoting these by 
H 1 (X;p) or H~(X;p) for compactly supported cochains. In order to get an action of the 
braid groups, we use (|20[). The action gives a canonical way of forming a split extension 
F n xB n . It turns out that in order to get an action on the twisted cohomology group what 
is required is exactly a representation of this split extension. Since Bi n is a subgroup of 
B n+ i, any representation of the latter will of course do the job, and that is why we think 
of B n as a subgroup of B n+ i . 

Theorem 16 [25] Given a representation p : F n xB„ — ► GL(V) we may construct another 
representation pf : B n — > i?^(5o,i, n ; p) where t is a new parameter. In particular, given 
any representation p : B n+ \ — > GL(V), we may construct a representation pf : B n — ► 
H c (So,l, n } P)- 

This works in exactly the way one might expect. The representation restricted to the free 
factor gives rise to the local system on the punctured disc and thus the twisted cohomology 
group and the compatibility condition provided by the split extension structure gives the 
braid group action. 

A comment is in order concerning Theorem 1161 Although the theorem is stated ab- 
stractly, (12(H) gives a concrete recipe which enables one to write down the description of pf 
given p. Moreover, as we have noted, any time that we have a representation of B n +i we 
also have one of the semi-direct product, which we identify with the subgroup Bi jT1 . 

The theorem shows that given a k parameter representation of the braid group, the 
construction yields a k + 1 parameter representation, apparently in a nontrivial way. For 
example, if one starts with the (zero parameter) trivial representation of F n xB n , the 
theorem produces the Burau representation, and starting with the Burau representation, 
the construction produces the Lawrence-Krammer representation |1(J2| . However the role 
of this extra parameter is not purely to add extra complication, it also adds extra structure. 
For there is a natural notion of what it should mean for a representation of a braid group 
to be unitary (See |125j . for example) and the results of Deligne-Mostow and Kohno imply: 



Theorem 17 jl()2j In the above notation, if p is unitary, then for generic values of s, so 
is pf. 




(20) 
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Open Problem 7 This problem is somewhat vague. It begins with a suggestion that 
Long's construction be studied in greater detail, and goes on to ask whether (a wild guess) 
all finite dimensional unitary matrix representations of B n arise in a manner which is related 
to the construction of Theorem \WV Jft 
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5 The word and conjugacy problems in the braid groups 



Let G be any finitely generated group. Fix a set of generators for G. A word w is a word in 
the given set of generators and their inverses. The element of G that it represents will be 
denoted [u], and its conjugacy class will be denoted {to}. We consider two problems. The 
word problem begins with u>,u' E G, and asks for an algorithm that will decide whether 
[uj] = [uj'] ? The conjugacy problem asks for an algorithm to decide whether {uj} = {uj'}, 
i.e. whether there exists a such that {uj 1 } = {oT l uia} ? A sharper version asks for a 
procedure for finding a, if it exists. In both cases we are interested in the complexity of 
the algorithm, and ask whether it is polynomial in either braid index n or word length 
| a; | or both? The word problem and the conjugacy are two of the three classical 'decision 
problems' first posed by Max Dehn [^J. In the case we consider here of G = B n , the set 
of generators we choose will be those of either the classical or the new presentation. Artin 
gave the first solution to the word problem for the braid group in 1925 0, and a number of 
fundamentally different solutions to the word problem in B n exist today, with at least two 
of them being polynomial in both n and \ui\. We focus here on an approach due to Frank 
Garside [Tjl] an d improved on by a number of others, and briefly describe other methods at 
the end of this section. 

On the other hand, we know of only one definitive solution to the conjugacy problem, 
namely the combinatorial solution that was discovered by Garside. He found a finite set of 
conjugates of an arbitrary element [uj] G B n , the 'Summit Set' of u, with the properties 
that if {uj} = {/?}, then S w = Sp, whereas if {uj} ^ {/?}, then S w n Sp = 0. If S w = S^, 
his methods also find a such that (3 = auia^ 1 . While his algorithm has been improved in 
major ways over the years, at this writing it is exponential in both n and \u\. The principle 
difficulties may be explained in the following way: 

(*) The entire set and a single element in must be computed to decide whether ui 
and (3 are or are not conjugate. While the calculation of a single element can now be 
done rapidly, in the general case has unpredictable size. The combinatorics that 
determine the size of are particularly subtle and difficult to understand, and at 
this writing only partial progress has been made, in spite of much effort by experts. 

The improvements that have been made over the years have included the replacement 
of by a proper subset S u , the 'Super Summit Set', via the work of EIRifai and Morton 
in |61| . However, while is very much smaller than S w , the principle difficulty (*) is 
unchanged. Very recently Gebhardt found a still smaller subset, the 'Ultra Summit Set 
fTu'j to replace [77]. The difficulty is unfortunately the same as it was for the summit 
set and the super summit set, but can be made more specific: the thing that one needs 
to understand is the number of 'closed cycling orbits' in U u and the length of each orbit, 
and how distinct orbits are related. Sang Jin Lee [211 has given examples to show that the 
number of orbits and their size can be arbitrarily large, and that it is in no way clear how 
distinct orbits are related, except in special cases. More work remains to be done. 

It was shown in Theorem ^ proved earlier in this article, that there is a faithful action 
of the braid group B n on the n-times punctured disc. Investigating this action (in the more 
general setting of the action of the mapping class group of a 2-manifold on the 2-manifold) , 
Thurston proposed in the early 1980's a very different approach to the conjugacy problem 
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which is based upon the dynamical properties of that action. These ideas were investigated 
in Lee Mosher's PhD thesis |112j . The curious fact is that while the dynamical picture 
quickly yields several very interesting class invariants not easily seen in the combinatorial 
picture, namely a graph (known as a 'train track' ) which is embedded in the surface and is 
invariant under the action of properly chosen representatives of {uj}, and a real number A, 
however they are not enough to determine conjugacy. Generic elements in the mapping class 
group of a surface have an invariant train track, but it is not unique. It is known that there 
are finitely many possible train tracks associated to a given element, but their enumeration 
remains an unsolved problem, and that is what gets in the way of a nice solution to the 
conjugacy problem. In fact, when one begins to understand the details, the entire picture 
suggests difficulties very much like those in (*) again. 

We have just one more remark of a general nature, before we proceed to review all these 
matters in more detail. One of the reasons that the Garside approach to the conjugacy 
problem is interesting is because the techniques that were developed for the braid groups 
revealed unexpected structures in B n that generalize to related unexpected structures in 
Artin groups (see H5.4j) . and in the less well-known class of 'Garside groups'. See § 15.21 
There has been major activity in recent years regarding these structures, and there are 
also many open problems. The same can also be said for the Thurston approach, as it too 
generalizes from an action of B n on the n-times punctured disc to actions of mapping class 
groups on curves on surfaces, discussed in H5.6I All of these matters, and other related ones, 
will be discussed below. 

5.1 The Garside approach, as improved over the years 

We assume, initially, that the group B„ is defined by the classical presentation (J2J), with gen- 
erators ax,... , o n -\. In this subsection we describe the solution to the word and conjugacy 
problem which was discovered by Garside in 1968 (see |76| ) and subsequently sharpened and 
expanded, in many ways, through the contributions of others, in particular Thurston |62j . 
EIRifai and Morton |61j . Birman, Ko and Lee (}25] and |24j). Franco and Gonzales- Meneses 
|7()j and most recently Gebhardt [Z7J. Other relevant papers are |82j and |8.3| . 

In (G1)-(G6) below we describe a constructive method for finding a normal form for 
words. In (G7)-(G10) we describe how to find a unique finite set of words in normal form 
that characterizes {uj}. 

(Gl) Elements of B n which can be represented by braid words which only involve positive 
powers of the <7j are called positive braids. A key fact which was observed by Garside 
in [7H] is that the presentation (J2J) not only defines the group B n , it also defines a 
monoid B+. He then went on to prove that this monoid embeds in the obvious natural 
way in B n , in the strong sense that two positive words in B n define the same element 
of B n if and only if they also define the same element in the monoid B+ . The monoid 
of positive braids is particularly useful in studying B n because the defining relations 
all preserve word length, so that the number of candidates for a positive word which 
represents a positive braid is finite. However, all this is of little use unless we can 
show that the monoid of positive braids is more than just a small and very special 
subset of B n . In (G3) below we show that this is indeed the case. 
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(G2) A special role is played by the Garside braid A in B n . It is a positive half-twist of all 
the braid strands, and is defined by the braid word 

A n = ((7 n _icr n _2 • • • cri)((T n _i<7 n _2 ■ ■ • er 2 ) • ■ ■ (cr n _ier n _ 2 )(o"n-l)- 
It is illustrated in sketch (i) of Figure l2"H for n = 5. The square of A n (a full twist of all 




(i) (ii) 



Figure 24: (i) The 5-braid A 5 . (ii) The 5-braid 5 5 . 

the braid strands) generates the infinite cyclic center of B n . The inner automorphism 
t : B n — > B n which is defined by t(X) = A _1 XA is the symmetry which sends each 
dj to cr n _j. Of course A itself is invariant under this symmetry. Later we will show 
that almost all of the combinatorics which we are describing hold equally well for the 
new presentation @. The role of A in the classical presentation is replaced by that 
of the braid 5 = er n _i jn ■ ■ ■ (J2,z<7i,2 which is depicted in Figure l2"lT ii). See ^15.31 below. 

(G3) The Garside braid A is very rich in elementary braid transformations. In particular, 
for each i = 1, . . . , n — 1 there exist (non- unique) positive braids Li and Ri such that 
A = LjCj = cijiij. This implies that a^ 1 = A _1 Lj = Ri A -1 for each i = 1, . . . , n — 1, 
so that an arbitrary word uj = erf* • • -erf^, e q = ±1, can be converted to a word 
which uses only positive braid generators, at the expense of inserting arbitrary powers 
of A -1 . Since, by (G2) we have (5A e = A e r(/3) for every braid (3, all powers of A -1 
can be moved to the left (or right). It follows that [u] is also defined by a word of the 
form A r T where T is positive. This representation is non-unique. 

(G4) To begin to find unique aspects, Garside observes that there is a maximum value, say 
i, of r with the property that the braid defined by uj is also represented by a word 
A l Z where Z is positive and i is maximal for all braids of this form. Note that this 
means that Z cannot be written in the form Z1AZ2 for any positive words Z\,Z%, 
because if it could, then it would be possible, by (G2), to rewrite A l Z as A 1+1 t(Zi)Z2, 
with t{Z\)Zi G B+, contradicting the maximality of i. The integer i is known as the 
infimum of [uj], and written inf(uj). It is an invariant of [uj]. Garside's complete 
invariant of [uj] is A 1 Zq, where among all positive words Zq, Z\, . . . , Z q that define 
the same element as Z, one chooses the unique word Zq whose subscript array (as a 
positive word in ai, . . . , cr n _i) is lexicographically minimal. This is Garside's solution 
to the word problem in B n . It is exponential in both n and \ui\ because there exist 
braids which admit very many of the elementary braid transformations in (|2"jl. To 
make this very explicit, we note that by |126j the number of positive words of letter 
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length (n)(n — l)/2 in the standard braid generators that represents A n is the same 
as the number of standard Young tableaux of shape (n, re — 1, . . . , 1). It is given by 
the hook length formula as {{n)(n - l)/2))!/(l ri - 1 3"- 1 5 n - 2 • • • (2n - 3)). Calculating, 
we learn that A5 can be represented by 768 positive words, all of letter length 10, in 
the a[s, giving a good idea of the problem. Moreover, this formula shows that the 
number of words increases exponentially with n. Of course the elements that are of 
interest to us, that is the positive braids defined by words like Z above, do not contain 
A, however there is not reason why they cannot contain, for example, arbitrarily high 
powers of large subwords of A. 

(G5) Let I and r be positive words such that there is a factorization of A in the form A = Ir. 
We call I a left divisor of A and r a right divisor of A. Let V (respectively V') be the 
collection of all left (resp. right) divisors of A. Note that we already used the fact 
that each of the generators U{ belongs to both V and V' . As it turns out, the sets V 
and V 1 coincide. The set V will be seen to play an essential role in Garside's solution 
to the conjugacy problem, but before we discuss that we describe how to use it to get 
a very fast solution to the word problem. The solution to the word problem was first 
discovered by Adyan [I] in 1984, but Adyan's work was not well-known in the West. 
It was not referenced by either Thurston (see Chapter 5 of 62 ) or EIRifai and Morton 
|6*T] . who rediscovered Adyan's work 8-10 years later, and added to it. Nevertheless, 
we use |61| and as our main source rather than because the point of view in 
those papers leads us more naturally to recent generalizations. 

(G6) A key observation is that there are re! braids in V, and that these braids are in 1-1 
correspondence with the permutations of their end-points, under the correspondence 
defined by sending each Ui to the transposition + 1). For this reason V is known 
as the set of permutation braids. This, combined with the fact that the crossings 
are always positive, makes it possible to reconstruct any permutation braid from its 
permutation, and so to obtain a unique element, even though its representation as 
a word is highly non-unique. Permutation braids have a key property: after the 
braid is tightened, any two strands cross at most once, positively. To understand the 
importance of this fact, we note that any positive 5-braid in which 2 strands cross at 
most once, whose associated permutation is (1,2,3,4,5) — > (5,4,3,2,1) must be A5, 
a criterion which is an enormous improvement over searching through the 768 distinct 
positive words that the hook length formula showed us represent A5, giving a good 
idea of the simplification that Adyan, Thurston and EIRifai and Morton discovered. 

The final step in finding a very rapid solution to the word problem in B n is a unique 
way to factorize the braid P in the partial normal form A l P of (G4) above as a unique 
product of finitely many permutation braids. If the strands in P cross at most once, 
then P £ V and we may choose any positive braid word, say l±, which has the same 
permutation as P as its representative. If not, set P = where l\ is a positive braid 
of maximal length in which two strands cross at most once and I* is the rest of P. 
If no two strands in I* cross more than once, set I2 = l*, and stop. If not, repeat, 
setting l\ = Z2Z2? where I2 has maximal length among all positive braids whose strands 
cross at most once, and so forth to obtain P = I1I2 ■ ■ ■ l s where each li £ V and each 
li has maximal length for all factorizations of 17 \ • • • 17 L as 1$* with li & V and I* 
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positive. This representation is unique, up to the choices of the words which represent 
li, . . . , l s . As we remarked earlier, each ^ is determined uniquely as an element in B+ 
by the permutation of its strands. 

The factorization of an arbitrary braid word as A l P = A'Hih ■ ■ ■ l s is the left greedy 
normal form, where the term 'left greedy' suggests the fact that each li is a maximal 
permutation subbraid relative to the positive braid to its right. The left greedy normal 
form solves the word problem. The integer i was already defined to be inf(u). The 
integer i + s is known as the supremum of u>, written sup(u>). The integer s is the 
canonical length L(lo). 

Remark 5.1 The Adyan-Thurston-ElRifai- Morton solution to the word problem is 
shown in [22] to be O(|o;| 2 nlogn), where \u>\ is the word length of the initial represen- 
tative of the braid w, as a word in the standard generators of B n . 

Example 5.1 We illustrate how to find the left greedy normal form for a positive 
braid via an example. Assume that we are given the positive braid 

The first step is to factor P as a product of permutation braids by working along the 
braid word from left to right and inserting the next partition whenever two strands are 
about to cross a second time since the beginning of the current partition. The reader 
may wish to draw a pictures to go with this example. This gives the factorization: 

( l 3 02 )( 02 cr i a 3 )( a 3 a 2 ct 3 )( cr 2 ) 

Next, start at the right end of the braid word, and ask whether the elementary braid 
relations can be used to move crossings from one partition to the partition on its 
immediate left, without forcing an adjacent pair of strands in a factor to cross. This 
process is repeated three times. The first time one applies the braid relations to factor 
3, and then to adjacent letters in factors 2 and 3, to increase the length of factor 2 at 
the expense of decreasing the length of factor 3: 

(010302) (02010302) (0302) (02-) 
In fact one can push one more crossing from factor 3 to factor 2: 

(010302) (0201020302) (02) (02), 
Now it is possible to move a crossing from factor 2 to factor 1, giving: 

(01030201) (02010302) (02) (02), 

No further changes are possible, we have achieved left greedy normal form. If it had 
happened that P contained A4, our factorization would reveal it because A4 G V . 
Each canonical factor represents a braid in V C B^~ and so has length strictly less 
than I A4I = 6. The words which represent the canonical factors are non-unique. Their 
associated permutations determine the canonical factors uniquely. 4* 
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(G7) We pass to the conjugacy problem, which builds on the solution just given to the word 
problem. From now on a word uj be always be assumed to be in left greedy normal 
form. Notice that the abelianizing map e : B ra — > Z has infinite cyclic image, and that 
e(uj) is the exponent sum of a representing word in the <7j. Clearly e(uS) is an invariant 
of both word and conjugacy class. It follows that there are only finitely many braids 
u' in the conjugacy class {uj} which have left greedy normal form with inf{u>') > 
inf(uj), because e(A) = (n)(n — l)/2 and any increase in inf(u>) must come at the 
expense of a corresponding decrease in word length of the positive word that remains 
after all powers of A have been pushed to the left. Let Inf(u) be the maximum value 
of inf(uj') for all braids uj' in the conjugacy class {uj}. Assume that a representative 
oj\ of {uj} is given and that it has left greedy normal form A 1 L1L2 ■ ■ ■ Lg, where 
I = Inf(uj) and S is minimal for all braids in normal form which are conjugate to 
uj. Let Sup(uj) be the integer I + S. Then Inf(uj) and Sup(uj) are class invariants of 
uj. The EIRifai-Morton super summit set is the collection of all elements in left- 
greedy normal form which realize Inf(uj) and Sup(u). It is a major improvement over 
Garside's summit set S(cj), which is the larger set of all elements which realize Inf(uj) 
but not Sup{uj), with left greedy form replaced by the subscript ordering described in 
(G4) above. 

One might wonder how the normal forms which we just described for words and 
conjugacy classes relate to length functions on B n . In this regard, R. Charney has 
introduced a concept of 'geodesic length' in [31]. As above, let V be the set of all 
permutation braids. Charney defines the geodesic length of a braid u to be the small- 
est integer K = K(uj) such that there is a word L^L^ 2 ■ ■ ■ L e £ , where each = ±1, 
which represents the conjugacy class {uj}, with each Li GV and each L~ l the inverse 
of a word in V . If A u L\Li ■ ■ ■ L s is an arbitrary element in the super summit set of 
uj, it is not difficult to show that the geodesic length of {uj} is the maximum of the 3 
integers (s + u, —u, s). We remark that Charney's geodesic length is defined in [13] for 
all Artin groups of finite type. This and other ways in which the Garside machinery 
generalizes to other classes of groups, including all Artin groups of finite type, will be 
discussed (much too briefly) in ^15.21 

Open Problem 8 In the manuscript [HE] Krammer gives several proofs of the faith- 
fulness of the Lawrence-Krammer representation of B n . One of his very interesting 
proofs shows that if [uj] € B n , then it is a very simple matter to read Charney's 
geodesic length from the matrix representation of uj. Since the unique element of 
geodesic length zero is the identity, it follows that the representation is faithful. Our 
suggestion for future work is to investigate the Garside solution to the conjugacy 
problem and its improvements (to be described below) via the Lawrence-Krammer 
representation of B n . We mention this because we feel that this aspect of Krammer 's 
work has received very little attention. <fc 

(G8) Here is a fast constructive procedure for finding Inf(uj) and Sup{u)), due to EIRifai 
and Morton [STJ and to Birman, Ko and Lee in |24| . Starting with any element in 
{to}, define the cycling of uj = A l /i?2 • • • h to be the braid c(uj) = t~ 1 (1\)ujt(Ii) and 
the decycling of uj to be the braid d(u>) = /^^(/fc) -1 . Note that c(lu) and d(u>) G {uj}. 
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Putting c(lo) and d(u) into left greedy normal form, one obtains braids which have at 
least i powers of A, and possibly more because it can happen that after cycling the 
braid A 1 /^ • • • h wm change to one whose left greedy normal form is A 1 Ifa ■ ■ ■ Its' 
with i' > i. This would, of course, reduce s if it increases i. In 61 EIRifai and 
Morton proved that if inf{uj) is not maximal for the conjugacy class, then it can be 
increased by repeated cycling. Also, if sup(uj) is not minimal, then it can be decreased 
by decycling. 

They could not say, however, how many times one might have to cycle or decycle 
before being sure that no further improvement was possible. The solution to that 
problem was found by Birman, Ko and Lee in [21]. It was shown that if inf{u>) is not 
maximal for the conjugacy class if it will be increased after fewer than (n)(n — l)/2 
cyclings, and similarly if sup(to) is not minimal it will be decreased after fewer than 
(n — l)(n)/2 decyclings. One then has a tool for increasing inf(uj) and decreasing 
sup(u), and also a test which tells, definitively, when no further increase or decrease 
is possible. 

Remark 5.2 The fact that everything we do to compute Inf(w) also applies to the 
computation of Sup(iij) is not surprising because EIRifai and Morton showed that 
Sup(u>) = -Inf(w _1 ). 

(G9) We now come to Gebhardt's very new work. Following the steps given in (G8) above, 
one will have on hand the summit set S(oj), i.e. the set of all braids in the conjugacy 
class {uj} which have left greedy normal form A 1 L1L2 ■ ■ ■ L$, where /= Inf(uj) is 
maximal for the class and I + S = Sup(uj) is minimal for the class. This set is 
still very big. Gebhardt's improvement is to show that it suffices to consider only the 
subset of braids which in a closed orbit under cycling. This finite set of words is called 
the ultra summit set U^. It has been proved by Volker Gebhardt [7?! that {uj} = {uj 1 } 
if and only if TJ W = U u '- Note that Gebhardt does not need to use decycling, he proves 
that it suffices to consider the closed orbits under cycling. 

(G10) To compute U u , Gebhardt also shows that if Ui,ujj £ U^, then there is a finite chain 
uji = u>i t i —* u>i t 2 Ui t q = Uj of braids, with each U{j in such that each 

is obtained from Uit—i by conjugating by a single element in V ■ Thus the following 
steps suffice to compute U u , after one knows a single element p £ U u : One first 
computes the conjugates of p by the n! elements in V . One then puts each into left 
greedy normal form, and discards any braid that either does not (a) realize Inf and 
Sup, or (b) realizes Inf and Sup but is not in a closed orbit under cycling, or (c) 
realizes Inf and Sup and is in a closed orbit under cycling but is not a new element 
in Uu. Ultimately, the list of elements so obtained closes to give U w . 

Note that in doing this computation, one not only learns, for each Ui,ujj G U u , that 
{iOj} = {uij}, but one also computes an explicit element a such that u)j = a~ l uiia. 

An inefficient part of this computation is the constant need to access the n! — 2 non- 
trivial elements in V. A more efficient process is known (see (Gil) below), but to 
describe it, we need new notions, which will be introduced after we discuss a wider 
class of groups, known as Gar side groups. 
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5.2 Generalizations: from B n to Garside groups 

The first person to realize that the structure described in (Gl)-(GIO) is not restricted to 
braids was Garside himself |7B], but his generalizations were limited to examples. Soon 
after his paper was published, the ideas were shown to go through, appropriately modified, 
in all finite type Artin groups, i.e., Artin groups whose associated Coxeter group is finite, 
by Breiskorn and Saito jlU] and by Deligne (SB]. See |1U6| for a survey (and reworking) 
of the results first proved in jlU] and |56| . (See ^5.41 for definitions of Artin and Coxeter 
groups in general.) They used explicit properties of finite reflection groups in their proof, 
but Paris and Dehornoy were thinking more generally and defined a broader class which 
they called 'Garside groups'. The class included all finite type Artin groups. Over the 
last few years, several tentative definitions of the term 'Garside group', referring to various 
classes of groups that generalize the braid group in this way, were proposed and appear in 
the literature (see (SH], . for example) before the one that we give below was agreed 
upon by many, although the search for the most general class of such groups continues and 
the definition of 'Garside groups' is likely to continue to be in flux for some time. As we 
proceed through the definitions of 'Garside monoids', 'Garside structures', and 'Garside 
groups', the reader can look to the classical presentation of B ra for examples. 

Given a finitely generated monoid G + with identity e, we can define a partial orders on 
its elements. Let a,b £ G + . We say that a -< b if a is a left divisor of b, i.e. there exists 
c G M, c / e, with ac = b. Also b y a if there exists c such that b = ca. Caution: the 
two orderings are really different, that is, a -< b does not imply that by a. An interesting 
example in the positive braid monoid B+ is the partial order induced on the elements in 
the sets V of left divisors of A n , which gives it the structure of a lattice. The reader who 
wishes to get a feeling for the ordering might wish to construct the lattice in the cases A3 
and A4. The lattice for A n has n\ elements. 

Now given a,b £ G + we can define in a natural way the (left) greatest common divisor 
of a and b, if it exists, written d = a A 6, as follows: d -< a,d -< b and if, for any x, it happens 
that x -< a and x -< b then x -< d. Similarly, we define the (left) least common multiple of 
a, b, denoted ra = aV b, if a ~< m, b -< m and if for any x it happens that a -< x and b -< x 
then m -< x. 

It turns out that in the case of the braid monoid B+ the left partial ordering extends 
to a right-invariant ordering on the full braid group B n , a matter which we will discuss in 
ED below. 

We continue with our description of the features of the monoid B+ C B n which will lead 
us to define more general monoids G + and their associated groups G. An element x G G + 
is an atom if x 7^ e and if x has no proper left or right divisors. For example, the generators 
a%, . . . ,£r n _i are the atoms in the monoid G + . Note that by (G3) above the atoms in B+ 
are left and right divisors of A, also they generate B+. 

While we have not had occasion to introduce a key property of the Garside braid A 
before this, we do so now: As noted earlier, its sets V and V' of left and right divisors of A 
coincide. This property was used in the proofs of the facts that we described in (Gl)-(GIO) 
above, and is part of a long story about symmetries in the braid group. 

A monoid G + is a Garside monoid if: 

1. G + is generated by its atoms, 
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2. For every x G G + there exists an integer l(x) > such that x cannot be written as a 
product of more than l[x) atoms, 

3. G + is left and right cancellative, and every pair of elements in G + admits a left and 
also a right least common multiple and greatest common divisor, 

4. There is an element A G G + , the Garside element , whose left divisors and right 
divisors coincide, also they form a lattice, also each generates G + . 

All of the data just listed is called a Garside structure . It follows (via the work of Ore, 
described in Volume 1 of the book [If)]) that every Garside monoid embeds in its group 
of fractions, which is defined to be a Garside group. It is not difficult to prove that the 
Garside element in a Garside monoid is the least common multiple of its set of atoms. An 
interesting property is that a Garside monoid admits a presentation {S \ R), where for every 
pair of generators x, y there is a relation of the form x ■ ■ ■ = y ■ ■ ■ that prescribes how to 
complete x and y on the right in order to obtain equal elements. 

Examples abound. While we shall see that the braid groups are torsion-free, there are 
examples of Garside groups which have torsion. Whereas the relations in the braid monoid 
all preserve word length (which results in various finiteness aspects of the algorithms that 
we described) there are examples of Garside monoids in which this is not the case. As noted 
above, every Artin group of finite type is a Garside group. Torus knot groups are Garside 
groups, as are fundamental groups of complements of complex lines through the origin |55j . 
See |121j for additional examples. 

Our reason for introducing Garside groups is that the solutions to the word problem 
and conjugacy problem in B n described in ^5.11 suitably modified, generalize to the class 
of Garside groups (jHSl) |121j ). as do the simplifications of (G10) which we now describe. 
These improvements were first discovered by Franco and Gonzales-Meneses [23 , and later 
improved by Gebhardt [77] . 

We return to the braid group, with the partial ordering of B n on hand: 

(Gil) We already learned that if we begin with an arbitrary braid lo, then after a bounded 
number of cyclings and decyclings we will obtain a braid in {lo} which realizes Inf{uj) 
and Sup(uj). Continuing to cycle, we will arrive at an element, say p, in XJ W . The re- 
maining task is the computation of the full set U u , and the method described in (G10) 
is inefficient. The difficulty is that, starting with p G U u one is forced to compute its 
conjugates by all the n! elements in V, even though many of those will turn out to 
either not realize Inf(u>) and/or Sup(uj), and so will be discarded, whereas others will 
turn out to be duplicates of ones already computed. Moreover, this inefficient step is 
done repeatedly. 

The good news is that, following ideas first introduced by N. Franco and J. Gonzalez- 
Meneses in |7fl| . Gebhardt proves in Theorem 1.17 of [?_7J, that if p G U u , and if 
a,b G V, with a~ 1 pa and b~ 1 pb G U u , then c~ 1 pc G U u , where c = a A b. See [7J3], and 
then [77], for a systematic procedure that allows one to use this fact to find all the 
orbits in XJ W efficiently. In this regard we remark that the work of Gebhardt is very 
new. The major open problem that remains is to improve it to an algorithm which 
will be polynomial in the word length of lo: 
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Open Problem 9 The bad news is that, like the Garside's summit set and EIRifai and 
Morton's super summit set, Gebhardt's ultra summit set also has the key difficulty (★). 
More work remains to be done. 4> 

5.3 The new presentation and multiple Garside structures 

As it turns out, essentially all of the structure that we just described also exists with respect 
to the second presentation of B n which was given in ^ of this paper. This is the main 
result of pij . The fundamental braid A n is replaced by 5 n = cr n -\,n^n-2,n-\ ■ ■ ■ 0"2,30"l,2- 
It is proved in [23] that the associated monoid B+ embeds in B„, and that all the results 
described in (Gl)-(Gll) above have curious (and very surprising) variations which hold in 
the new situation. 

For example, the elements in the set of left divisors of 5 n are in 1-1 correspondence with a 
set of permutations, namely permutations which are products of non-interlacing descending 
cycles. The set of left divisors of 5 n turns out to have order equal to the n th Catalan number, 
(2n\)/(n\(n + 1)!), whereas the left divisors of A have order n\. The fact that the order 
of the set of permutation braids is smaller in the new presentation than in the classical 
presentation had led to the hope by the authors of [21], when they first discovered the new 
presentation, that it would result in a polynomial algorithm for the conjugacy problem, 
but the Catalan numbers grow exponentially with index, and once again (*) proved to be 
a fundamental obstacle. Thus, while the new presentation is extremely interesting in its 
own right, and does lead to faster word and conjugacy algorithms, the improvement in that 
regard does not address the fundamental underlying difficulties. 

Open Problem 10 Curiously, it appears very likely that the classical and new presenta- 
tions of B n are the only positive presentations of B n in which the Garside structure exists, 
although that has not been proved at this time, and is an interesting open problem. For 
partial results in this direction, see [HI]. 

There is a different aspect of the dual presentations which we mention now, which 
involves a small detour. Before we can explain it, recall that one of our earliest definitions 
of the braid group P n , given in Hl.l\ was as the fundamental group tt\ (Cofi , p) , of the space 
formed from C n by deleting the hyperplanes along which two or more complex coordinates 
coincide. Of course this gives a natural cell decomposition for Co ; n as a union of (open) cells 
of real dimension In. The braid group B n , as we defined it in Hl.ll is the fundamental group 
of the quotient Co, n = Co.n/^n, where the symmetric group S ra acts on Co,n by permuting 
coordinates. In the interesting manuscript |69| . Fox and Neuwirth used this natural cell 
decomposition of Co,n to find a presentation for B n , arriving at the classical presentation 
(J2J for the braid group B n . See Section C of Chapter 10 of [33] for a succinct presentation 
of the results in (HHl- Fox and Neuwirth also prove that Co,n is aspherical, and use this to 
give the first proof that B n is torsion-free. See ^6.41 below for a different and very easily 
understood proof of that same fact. 

Around the same time that the first author, together with Ko and Lee, wrote the paper 
|23| . which introduced the dual presentation of B n , Thomas Brady was thinking about 
other complexes which, like the one just described might serve as a K (n, 1) for the group 
B n . See which describes the construction of a finite CW-complex K n of dimension 
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n—1 which is homotopy equivalent to Cq )TL , and which is defined combinatorially, using the 
partial ordering on £ n , as described in ^5.21 in the discussion of the lattice of simple words 
in B n . Brady then used his complex to determine a presentation for B n , arriving at the 
dual presentation |J5J). 

As it turned out, the new presentation was important for other reasons too. The braid 
groups inherit a second Garside structure from the new presentation ©. The fact that 
the same was true for many other Garside groups played a major role in the discovery of 
the appropriate definitions. The Garside structure on the braid group arising from the new 
presentation ends up revealing even more structure: it is dual to that coming from the 
standard presentation, in the sense of an action on a complex and a dual complex. As it 
turned out, all finite type Artin groups of finite type also have dual 'Garside structures, as 
proved by Bessis (this reference also contains the details of this dual structure). See 
also [37]. We also refer the reader to |121| for explicit presentations for the dual monoids 
associated to the Artin groups of finite type, and also for an interesting table that gives 
the number of simple elements defined by the left divisors of the Garside element in the 
classical and dual monoids for the finite type Artin groups. 

Open Problem 11 This one is a very big set of problems. It is not known whether all 
Garside groups have dual presentations, in fact, it is also not known how many distinct 
Garside structures a given Garside group may have. £ 

5.4 Artin monoids and their groups 

There is another class of groups which is intimately related to the braid group and its 
associated monoid, but it is much less well understood than the Garside groups. Let S = 
{u, ....... ,t, ... ,v} be a finite set. A Coxeter graph T over S is a graph whose vertices are 

in 1-1 correspondence with the elements of S. There are no edges joining a vertex s to itself. 
There may or may not be an edge joining a vertex s to a vertex t ^ s. Each pair of vertices 
(s,i) = (i, s) is labeled by a non-negative integer m(s,t). There are 2 types of labels: The 
label m(s, t) is 2 if T does not have an edge that joins s and t, and it is € {3, 4, ... , 00} if 
there is an edge joining s and t. 

The Artin group ^4(r) associated to T has generators {a s s £ S}. There is a relation for 
each label m(s, t) < 00, namely o s oto s ot ■ ■ ■ = atcr s atcr s • • • , where there are m(s, t) terms 
on each side, 2 < m(s, t) < 00. The Coxeter group C(F) associated to the Artin group ^4(T) 
is obtained by adding the relations = 1 for every s £ S. As previously mentioned, we 
say that the group ^4(r) has finite type if its associated Coxeter group is finite. The braid 
group B n is an example of an Artin group of finite type; its associated Coxeter group being 
the symmetric group T, n . Note that by definition the defining relations in the presentations 
that we just described for Artin groups all preserve word length. 

One of the many interesting properties of Artin groups is that for every Artin group 
there is an associated monoid A + , and just as every Garside monoid embeds in its group, 
it was proved by Paris in |117j that every Artin monoid injects in its group. The injectivity 
property holds in the following strong sense too: If two positive words P, P' represent the 
same element of A, then they represent the same element of A + . Paris's proof is completely 
different from Garside's proof of injectivity in the case of the braid group, which is the basis 
for the known proofs of the same fact for Garside groups. On the other hand, the other 
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properties that are needed to obtain a Garside structure may or may not hold, for example 
it is definitely not true that every element in an Artin group can be written in the form 
NP, where N is negative and P is positive. 

Remark 5.3 An interesting special case of Artin groups are the right-angled Artin groups. 
They are Artin groups in which the defining relations are all of the form o s Ot = crta s . 
For example, the right-angled Artin group associated to the braid group is defined by the 
presentation: 



Open Problem 12 In H4.5I we explained the fairly recent proof that the braid groups 
are linear. This leads one to ask, immediately, whether the same is true for its natural 
generalizations, e.g., Garside groups and Artin groups. It turns out that, like the braid 
groups, all right-angled Artin groups have faithful matrix representations [HE]) (EH!- Of 
course any Artin group that injects as a subgroup of a related braid group is also linear, 
e.g. Artin groups of type B n . It was proved recently by A. Cohen and D. Wales and 
simultaneously by F. Digne [HB] that all finite type Artin groups are linear. Going beyond 
that, the matter seems to be wide open and interesting. & 

Remark 5.4 For reasons of space, we have not included any significant discussion of the 
vast literature on Artin groups and associated complexes on which they act. It is a pity to 
omit it, because it is a major area, and much of it had its origins in work on B n . 

5.5 Braid groups and public key cryptography 

The problem which is the focus of of 'public key cryptography' was mentioned, very briefly, 
in t jl.4.51 The basic issue is how to send information, securely, over an insecure channel. 
The solution is always to use some sort of code whose main features are known to the 
sender and recipient, but which cannot be deduced by a viewer who lacks knowledge of the 
shared keys. To the best of our knowledge, all solutions to this problem rest on the same 
underlying idea: they make use of problems which have a precise answer, which is known 
to both the sender and recipient, but one which is deemed to be so difficult to compute 
that it is, in effect, unavailable to a viewer, even though the viewer has all the necessary 
data to deduce it. The earliest such schemes were based upon the difficulty of factorizing 
large integers into a product of primes. The individuals who wish to exchange data over a 
public system are denoted A and B. In a vastly oversimplified version, Both A and B have 
agreed, privately, on the choice of a prime number p. The sender chooses another prime q 
and transmits the product n = pq. A viewer may learn n, but because of the difficulty of 
factorizing n into primes cannot deduce p and q. The recipient, who knows both p and n, 
has no problem computing q. 

A more recent approach is due to W. Diffie and M. Hellman |^Zj. As before, both A 
and B have agreed, privately, on the choice of a prime number p and a generator g of 
the finite cyclic group Z/pZ. A chooses a number a at random and computes g a (mod p), 
which she sends to B on the public channel. As for B he chooses a number b at random, 
computes <? b ((mod p) and sends in to A on the public channel. Since A now knows both g b 
and a she can compute (g a ) b = g ab (mod) p. Similarly, since B now knows both g a and b, 




(21) 
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he can compute (g b ) a = g ba (mod p) = g ab (mod p). So both know g ab (mod p). As for the 
viewer, he knows both g a (mod p) and (^(mod p), however, because of the known difficulty 
of computing discrete logs, the crucial information <7 a6 (mod p) is in effect unavailable to the 
viewer. 

A rather different set of ideas was proposed in [3] and |92| , and this is where the braid 
group comes in. The security of a system that is based upon braid groups relies upon 
the assumption that the word and conjugacy problems in the braid group have both been 
solved, but the conjugacy problem is computationally intractable whereas the word problem 
is not. However, as we have seen, that matter seems to be wide open at this moment. We 
have described the underlying mathematics behind the EIRifai-Morton solution to the word 
problem, and the best of the current solutions to the word and conjugacy problem, namely 
that of Gebhardt. The solution to the word problem is used in the conjugacy problem. The 
reason that the conjugacy problem is not polynomial is that we do not understand enough 
about the structure of the summit set, the super summit set and the ultra summit set. 
Our strong belief is that these matters will be settled. The assumption that our current 
lack of understanding of aspects of the mathematics of braids means that they cannot be 
understood seems unwarranted. 

5.6 The Nielsen-Thurston approach to the conjugacy problem in B n 

In this subsection we consider the conjugacy problem in B n from a new point of view. We 
return again to the interpretation given in Theorem I of <jl .31 of the braid group B n as 
the mapping class group A^o,l,n of the punctured disc £o,i,n = D^. In this context, then, 
we emphasize that the term braid refers to a mapping class, that is, an isotopy class of 
diff eomor phisms . 

The Nielsen-Thurston classification of mapping classes of a surface is probably the single 
most important advance in this theory in the last century, and we review it here. For 
simplicity, we shall focus for now on the mapping class group of a closed surface M g = 
■Mgfifl in the notation of We note that the groups M. g have trivial center for all g > 3. 

We make the tentative definition that an element of A4 g is 'reducible' if it preserves up 
to isotopy a family of disjoint nontrivial curves on the punctured disc. Such a family of 
curves is known as a reduction system. (Throughout this discussion, we use 'curve' to refer 
to the isotopy class of a curve. By 'disjoint curves', we mean two distinct isotopy classes 
of curves with respective representatives which are disjoint.) It was proved in that 
for a reducible map cf>, there exists a essential reduction system, denoted ERS(0). A curve 
c G ERSO) if and only if 

1. There exists an integer k such that (f) k (c) = c. 

2. If a curve x has nonzero geometric intersection with c, then (f> m (x) ^ x for all integers 
m. In particular, any curve x which has nonzero geometric intersection with a curve 
ce ERS(0) is not in ERS (</>). 

It is proved in |2fi| that the curves in ERS(^>) are contained in every reducing system for 
<j), and are a minimal reduction system for <p. Also, the system of curves ERS(</>) is unique. 
Keeping all this in mind, we now define an element G M g to be reducible if it fixes the 
curves in an essential reduction system, i.e. in ERS ((/>). 
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A mapping class (f> S Ai g is periodic if <fi k = 1 in M g . 

We note that a reducible mapping class always contains a representative diffeomorphism 
which fixes a given reduction system setwise. Likewise, Nielsen showed that a periodic map- 
ping class always contains a representative which is periodic as a diffeomorphism. Finally, 
a mapping class which is neither reducible nor periodic is pseudo-Anosov (abbreviated as 
PA). 

Braid groups, as the mapping class group of a punctured disc, admit a similar classifi- 
cation. The group B n is torsion-free, but by analogy with the above case of mapping class 
groups of closed surfaces of genus at least two, which are centerless, we say that an element 
u> £ B ra is periodic if for some integer k, uj k is isotopic to a full Dehn twist on the boundary 
of the disc Z) 2 . In terms of generators and relations this is equivalent to saying that the 
braid is a root of A 2 , where A is the Garside braid. (Recall from H5.ll that the square of 
A n generates the center of B n .) 

The definitions of reducible braids and pseudo-Anosov (PA) braids require no alteration 
other than replacing the surface S g with the punctured disc. 

Remark 5.5 It is clear for B ra and A4 g alike, we also have a classification of elements as 
reducible, periodic and pseudo-Anosov up to conjugacy. 

Each of these three possibilities reveals new structure, so we consider them one at a 
time. A major reference for us is the paper j^j, which gives an algorithm for finding a 
system of reducing curves if they exist, and for recognizing periodic braids. This paper did 
not receive much attention at the time that it was written because it had the misfortune 
to be written simultaneously and independently with the ground-breaking and much more 
general papers of Bestvina and Handel ^2]. However, as is often the case, one learns very 
different things by examining a particular case of a phenomenon in detail, and by proving 
a broad generalization of the same phenomenon, and that is what happened here. 

Periodic braids. The first author learned from [B] that the classification of periodic 
braids had been solved via the work of Kerekjarto (1919) and Eilenberg (1935), who proved 
that up to conjugacy a periodic braid is a power of either 5 or a, where 6 = o\02 ■ ■ ■ o n -\ 
and a. = o\<Ji ■ • • cr n-l cr l- One verifies, using the elementary braid relations in the classical 
presentation, that 5 n = a"" 1 = A 2 . To visualize the assertion for 5 geometrically, place 
the punctures at polar angles 2^k/n around a circle of radius r < 1, and think of 5 as 
a rotation of D 2 of angle 2ir/n. To visualize a, do the same, only now place one of the 
punctures at the origin and arrange the remaining n — 1 punctures symmetrically at polar 
angles 2 / nk/{n — 1). 

Reducible braids. Let u = A 1 L1L2 ■ ■ ■ L s be a reducible mapping class in left greedy 
normal form, as described in fl5.ll Our initial model for Z) 2 will be the unit disc, with the 
n punctures arranged along the real axis, and placed symmetrically so that they divide the 
interval [—1,1] Cl into n + 1 equal line segments. It is shown in 8 that one can choose a 
braid u' conjugate to lo and in the super summit set of u which fixes the simplest possible 
family of closed curves, namely a family C of geometric ellipses whose centers are on a 
horizonal 'axis' through the punctures, and which are chosen in such a way that the axis 
bisects the discs that the curves in CRS(u/) bound. 
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Open Problem 13 Is it true that, if a reducible braid is in its ultra summit set, then its 
invariant multicurves are can always be chosen to be geometric ellipses? £ 

The remaining braiding may then be thought of as going on inside tubes, also the tubes 
may braid with other tubes. With this model it should be intuitively clear that the choice 
of such a representative for the conjugacy class of a braid which permutes the tubes in the 
required fashion may require very different choices (again up to conjugacy) for the braids 
which are inside the tubes, and that the sensible way to approach the problem is to cut the 
initial disc open along the reducing curves and focus on the periodic or PA maps inside the 
tubes. This is, of course, a very different approach from the one that was considered in ^5.11 
above, where no such considerations entered the picture. 

Pseudo-Anosov braids. In the PA case (which is the generic case), there is additional 
structure, and now our description becomes very incomplete. In this case |128j there ex- 
ist two projective measured foliations T u and T & which are preserved by an appropriate 
representative w of {w}. Moreover the action of w on T u (the unstable foliation) scales 
its measure by a real number A > 1, whereas the action on J- s (the stable foliation) scales 
its measure by 1/A. These two foliations and the scaling factor A are uniquely determined 
by the conjugacy class of {u}, however the triplet (.F 11 , .F s , A) does not determine {uj}. To 
explain the missing pieces, we replace the invariant foliations by an invariant 'train track', 
with weights associated to the various branches. 

In the case of pseudo-Anosov mapping classes acting on a once-punctured surface, a 
method for enumerating the train tracks is in Lee Mosher's unpublished PhD thesis |112j . 
But Mosher's work was incomplete and remained unpublished for many years, even as many 
of the ideas in it were developed and even expanded, leading to some confusions in the 
literature about exactly what is known and what remains open. At this writing a complete 
solution to the conjugacy problem for braid groups or more generally surface mapping class 
groups, based upon the Nielsen-Thurston machinery, does not exist in the literature. Mosher 
has a partially completed monograph in preparation, Train track expansions of measured 
foliations, which promises to give such a solution, but for more general surfaces, i.e. for 
Sg,b,n very little seems to be known. We therefore pose it as an open problem: 

Open Problem 14 Investigate the conjugacy problem in the mapping class groups A4 9; b,n 
with the goals of (i) pinning down precisely what is known for various triplets (g,b,n); (ii) 
describing all cases in which there is a complete solution; and (hi) describing what remains 
to be done in the simplest cases, that is A4 gi o,o and A4 fll Q,i. ♦ 

Essentially nothing is known, at this writing, about the interface between the dynamic 
and combinatorial solutions to the conjugacy problem in the braid groups, and still less 
in the general case of the more general mapping class groups Ai g fi,n, to which the entire 
Thurston machinery applies. We know of nothing which even hints at related dynamic 
structures in Artin groups or Garside groups. 

Open Problem 15 Does the Garside approach to the conjugacy problem in A4o,i,n gen- 
eralize to a related approach to the problem in Ai g .i,o for any g > 0? J(t 

Open Problem 16 Does the Nielsen-Thurston approach to the conjugacy problem in 
•M.o,i,n generalize to a related approach to the problem in any other Artin group? ft 
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5.7 Other solutions to the word problem 

We review, very briefly, other ways that the word and conjugacy problems have been solved 
in the braid groups. We restrict ourselves to results which revealed aspects of the structure 
of B n that has had major implications for our understanding of braid groups, even when 
the implications for the word and/or conjugacy problems fall short of that criterion. 

1. Artin's solution to the word problem. The earliest solution to the word problem 
was discovered in 1925 by Artin, in his first paper on braids [SJ. It was based upon his anal- 
ysis of the structure of the pure braid group, which we described in fll.21 and in particular 
in the defining relations |I] and the sequence given just below it: 

{1} - F„_i -^P n 1 P n _i - {1}. 

The resulting normal form, described in jSj as combing a braid, is well known. We refer the 
reader to Artin's original paper for a very beautiful example. In spite of much effort over 
the years nobody has managed, to this day, to use related techniques to solve the conjugacy 
problem in B n , except in very special cases. 

2. The Lawrence-Krammer representation. See ^1 above. When a group admits 
a faithful matrix representation, there exists a fast way to solve the word problem. It 
is interesting to note that Krammer's first proof of linearity, in the case of B4, used the 
solution to the word problem which came from the presentation © in ^ to this paper. See 
|97j . Also, when he proved linearity in the general case, in j^H], he gives two proofs. The 
second proof, Theorem 6.1 of [UHJ , shows that the Lawrence-Krammer matrices detect the 
infimum and supremum of a braid. It follows as an immediate corollary (noted in 98 ) that 
the representation is faithful, because any braid which has non-zero infimum and supremum 
cannot be the identity braid. 

As regards the conjugacy problem, there is a difficulty. The image of B„ under the 
isomorphism given by the Lawrence-Krammer representation yields a group B n which is a 
subgroup of the general linear group GLmiZlq* 1 ,^ 1 ]), where m = (n)(n - l)/2. To the 
best of our knowledge it is unknown at this time how to describe B n in any way that allows 
one to test membership in B n . Lacking such a test, any class invariants which one finds in 
this way will be limited in usefulness, because if ui E B n , there will be no way to distinguish 
between class invariants which arise from conjugation by elements in B n from those which 
arise from conjugation by more general elements of GL m {'L[q^ 1 , t^ 1 ]) from invariants thus 
one cannot hope for a complete solution. 

Open Problem 17 Investigate the eigenvalues and the trace of the Lawrence-Krammer 
matrices. £ 

3. The Dehornoy ordering. A group or a monoid has a right-invariant (resp. left - 
invariant ) ordering if there exists a strict linear ordering of its elements, denoted <, with 
the property that if f,g,h £ G, then f < g implies fh < gh (resp. hf < hg). To the 
best of our knowledge nobody had considered the question of whether B n had this property 
before 1982, when Patrick Dehornoy announced his discovery that the groups B n admit 
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such an ordering, and that it can be chosen to be either right-invariant or left-invariant, 
but not both. In the 5-author paper jST] the Dehornoy ordering was shown to have the 
following topological meaning. We restrict to the the right-invariant case, the two results 
being essentially identical. We regard B n as A^o,l,n> where the surface <Sb,i,n is the unit 
disc in the complex plane and the punctures lie on the real axis in the interval (—1,1). The 
punctures divide [—1, 1] Cl into n + 1 segments which we label £i,£2> . . . , £ n +l i n order, 
where £i joins {—1} to the first puncture. Choose [uji], [u^] € B n , and representatives u)\,u)2 
with the property that k>i([— 1, 1]) and u>([— 1, 1]) intersect minimally. Note that Wfc([— 1, 1]) 
divides Sq 1 n into two halves, and it makes sense to talk about the upper and lower half of 
u>2([— 1, 1]) because {—1} and {1} are on <9So,i,n and | <9So,i,n is the identity map. Then 
[0)2] > [wi] if wi(^i) = u>2(£i) for i = 1, . . . , j — 1 and an initial segment of tt>2(£j) lies in the 
upper component of 5o,l,n — oji ( [ — 1 , 1] ) . 

It was proved in |131j that the resulting left-invariant ordering of B n extends the ElRifai- 
Morton left partial ordering that we described in ^ 15.21 above. It is proved in [21] that there 
are infinitely many other left orderings. 

The subject blossomed after the paper (HZj appeared. In fact so much work resulted 
that there is now a 4-author monograph on the subject [31], written by some of those who 
were the main contributors, containing an excellent review of what has been learned during 
the 10 years since the initial discovery. We do not wish to repeat what is readily available 
elsewhere, especially because we are not experts, so we point the reader to Chapter 3 of 
|54| . where it is shown that the Dehornoy ordering leads to a solution to the word problem. 
Unfortunately, the solution so-obtained is exponential in whereas the solution described 
in (G1)-(G6) above is quadratic in \u>\. 

4. B n as a subgroup of Aut (F n ). It is well known that B n has a faithful representation 
as a subgroup of Aut(F n ), for example see |18| for a proof. This of course gives a solution 
to the word problem, but (like the Dehornoy solution) it is exponential in \uj\. On the other 
hand, it is extremely interesting that the Nielsen-Thurston machinery, described in H5.6I 
above, generalizes to the entire group Aut(F n ), giving yet another instance where the braid 
group is at the intersection of two rather different parts of mathematics, and could be said 
to have pointed the way to structure in the second based upon known structure in the first. 

In this section we have described several solutions to the word problem, and noted that 
one of them (the modified Garside approach of (G1)-(G6)) is O(\uo\ 2 (n)(log(n)), where \u\ 
is the letter length of two arbitrary representatives of elements of B n , using the classical 
presentation (j2J) for B n . The word problem seems to be one short step away from the 
non-minimal braid problem: given a word lo in the generators a%, ... ,cr n _i and their in- 
verses, determine whether there is a shorter word to' in the same generators which represents 
the same element of B n ? For, it is clear that a decision process exists: list all the words that 
are shorter than the given one, thin the list by eliminating as many candidates as possible 
by simple criteria such as preserving length mod 2, an obvious invariant, and then test, one 
by one, whether the survivors represent the same element of B n as the given word ul It 
therefore seemed totally surprising to us that in 1991 M. S. Patterson and A. A. Razborov 
proved: 

Theorem 18 |119j : The non-minimal braid problem is NP complete. 
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Thus, if one could find an algorithm to decide whether a given word u is non-minimal, and 
if the algorithm was polynomial in |w|, one would have proved that P = NP\ To the best 
of our knowledge, essentially nothing has been done on this problem. In this regard we 
suggest two research problems: 

Open Problem 18 The proof that is given in |119j is very specific to the classical pre- 
sentation © for B n . Can it be adapted to the new presentation? To other presentations? 
* 

Open Problem 19 Investigate the shortest word problem in the braid group B n , using 
the classical presentation. A 

In £13.11 we discussed and proved the Markov Theorem Without Stabilization in the 
special case of the unknot. See TheoremE3 which asserts that there is a complexity measure 
on closed braid representatives of the unknot, and using it a sequence of strictly complexity- 
reducing destabilizations and exchange moves that reduce a closed braid diagram for the 
unknot to a round planar circle. In [22], theorem \5\ was used to develop an algorithm for 
unknot recognition. In [221 the first steps were taken to develop a computer program to 
realize the resulting algorithm. The algorithm is far from being practical in even simple 
cases, however we are now in a position to describe three open problems, all closely related 
to (Gl)-(G-ll) above, which would lead to an effective solution to the unknot recognition 
problem: 

Open Problem 20 Develop an algorithm that will detect when the conjugacy class of a 
closed braid admits a destabilization. A 

An n- braid W admits an exchange move if it is conjugate to a braid of the form 
Ua n -iV(T~_i, where U and V depend only on a\, . . . , a n -2- Thus, up to conjugation, 
W is a product of two reducible braids, one positively reducible and the other negatively 
reducible. By Theorem it may be necessary to modify the given conjugacy class by 
exchange moves in order to jump from the given class to a new class (if it exists) which 
admits a destabilization. Thus there is an unknown complexity measure on conjugacy 
classes, with the ones which admit destabilizations being especially nice. This leads us to 
our second open problem. 

Open Problem 21 Develop an algorithm that will detect when the conjugacy class of 
a closed braid admits an exchange move, and when a collection of classes are exchange- 
equivalent. 4k 

This would still not be a complete tool, because exchange moves can be either be 
complexity-reducing or complexity-increasing, and unless we can tell the difference (that is 
the content of open problem 3 below) we are left with the option of trying a sequence of 
exchange moves of unpredictable length in our search for destabilizations. 

Open Problem 22 The complexity measure that was introduced in ^3.1l is 'hidden' in the 
braid foliation of an incompressible surface whose boundary is the given knot. This is a 
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highly non-trivial matter, because if we had the foliated surface in hand, then we would be 
able to compute its Euler characteristic and would know whether we had the unknot. On 
the other hand, the braid foliation determines the embedding of its boundary (see Theorem 
4.1 of so there is no essential obstacle to 'seeing' the complexity measure in the 

given braid. That is the essence of our third open problem, which asks that we recognize 
how to translate the complexity measure that was used in the proof of Theorem [5] into a 
complexity measure on closed braids which will be able to distinguish exchange moves that 
reduce complexity from those which do not. Jt» 
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6 A potpourri of miscellaneous results 



This section is for leftovers — topics on which there have been interesting new discoveries 
which did not seem to fit well anywhere else. 

6.1 Centralizers of braids and roots of braids 

The mixed braid groups are defined in jSH] to be the braids which preserve a given partition 
of the puncture points on the disc. In [S3] Gonzales- Meneses and Wiest gave full descriptions 
of the centralizer of a braid in terms of semi-direct and direct products of mixed braid groups, 
and found sharp bounds on the number of generators of the centralizer of a braid. 

In a related paper [52] Gonzales- Meneses proved that, up to conjugacy, braids have 
unique roots. That is, if uo £ B n and if a, (3 G B n have the property a k = (3 k = u>, then 
{«} = {/?}• 

6.2 Singular braids, the singular braid monoid, and the desingularization 
map 

The singular braid monoid SB n is a monoid extension of the braid group B n . It was 
introduced in [7] and, simultaneously and independently, in [20], Its definition was suggested 
by the mathematics which revolved about Vassiliev invariants of knots and links. To define 
it, we need to describe a presentation for SB„, taken from [20]. There are three types of 
generators, which we call cr^cr" 1 and Tj, 1 < i < n — 1. Here the of 1 are to be thought 
of as the classical positive and negative elementary braids and the Tj are to be thought 
of as elementary singular braids. The braid T{ is obtained from Oi (see Figure l2H (ii)) by 
identifying strands i and i + 1 as they cross. Defining relations in SB n are: 

Vi^i 1 = &i 1(J i = 1) Cm = Ti<Ji, 

&i<Jj = CTjCTi, CTiTj=Tj(Ti, T{Tj=TjTi if \i — j\ > 2 
OiOjOi = OjCJiCJj, OiCFjTi = TiOiOj if \i — j\ = 1. 

The desingularization map is a homomorphism from SB n to the group ring ZB n of 
the braid group, defined by ip(af 1 ) = <x i 1 , V J ( r i) = °i ~ '■ Birman used this 

map to develop the relationship between Vassiliev invariants and quantum groups'. It was 
conjectured in [201 that the map ip is an embedding. After several proofs of special cases of 
the conjecture, it was settled by Luis Paris in the affirmative in 2003 |118j . 

During the 10 year interval after the introduction of the singular braid monoid, and 
before the proof of the embedding theorem, it came as quite a surprise when it was discovered 
that there was a new group on the scene-the singular braid group of [7J . To this day we are 
unsure of its significance, although its existence is unquestioned! It is most easily defined via 
generators and relations, starting with the presentation that we just gave for the singular 
braid monoid, and then adding one new generator f (to suggest that it behaves the way 
that the inverse of r ought to behave). Defining relations for GB„ are all the relations in 
SB n , plus ones satisfied by the new generator. The latter are 'monoid relations' which are 
the same as those in the singular braid monoid, but substituting f for r, and two additional 
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relations, namely tt = tt = 1. Pictorially, one has two types of singular crossings, and they 
annihilate one another. The main result in [7J is that SB n embeds in GB n . 

6.3 The Tits conjecture 

The Tits conjecture is very easy to state, in its simplest form. Consider the classical 
presentation of the braid group B n , i.e. the presentation (J2J). The Tits conjecture, in the 
special case of the braid group, is that the subgroup G of B n generated by the elements 
Tj = af has the presentation (Ti, . . . , T n _i | T{Tj = TjTi if \i — j\ > 2). A generalized 
version of the conjecture (the generalization relates to the arbitrary choices of the powers) 
was proved by Crisp and Paris in 2001, settling a question which had plagued the experts 
for many years: 

Theorem 19 Let S be a finite set, let T be a Coxeter graph over S, and let A(T) be 
the associated Artin group, as defined in §5.4\ with generating set {a s , s E S}. Associate 
further to each s £ S an integer m s > 2. Consider the subgroup G of A(T) generated by 
the elements {T s>ms = cr™ s }. Then defining relations among the generators of the subgroup 
G are 'the obvious ones', namely that T StTng and Tt )Tnt commute in G if and only if they 
commute in A(T). No other relations are needed. 

The proof is interesting, because it introduces a technique which is very closely related 
to the themes that we have explored in this article. The basic idea is that, for a key subclass 
of Artin groups there is a representation / of A(T) into the mapping class group Ai(S) of 
a connected surface S with boundary that is associated to the graph T, which induces an 
action of A(T) on a monoid determined by T. Since the group H(T) which is presented 
in the theorem has an obvious homomorphism onto A(T), and the proof shows that the 
restriction of the action to H(T) gives the desired isomorphism. 

6.4 Braid groups are torsion-free: a new proof 

As we stated in § it was necessary to make choices in the writing of this review, and 
our decision was to be guided by the principle of focusing on new results or new proofs of 
known results. During the years since JH] was written, many people have written to the first 
author with questions about braids, and a regular question has been "Isn't there a simple 
proof that the braid groups have no elements of finite order?" It is therefore fitting that we 
end this review with exactly that - a beautiful simple proof, based upon the discovery, due 
to Dehornoy, that the braid groups admit a left-invariant ordering: 

Theorem 20 The groups B n , n = 1,2,3,... are all torsion free. 

Proof: See M5.7l for a discussion of the Dehornoy left-invariant ordering of the braid groups. 
Choose any element g E B n , g ^ 1. Replacing g if necessary with g _1 we may assume that 
1 < g. Since the ordering is left-invariant we then have that g < g 2 and g~ l < 1. Iterating, 

• • • < 9~ 3 < g~ 2 < g' 1 < i < 9 < g 2 < a 3 < ■ ■ ■■ II 
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Appendix: Computer programs 



In 2004, it is almost as important to know about computer tools as it is to have a guide 
to the literature, so we supplement our bibliography with a guide to the computer tools 
that we know about and which have been useful to us and colleagues. 



Changing knots and links to closed braids. Vogel's proof of his method for changing 
arbitrary knot diagrams to closed braid diagrams is ideal for computer programming. We 
refer the reader to the URL http://www.layer8.co.uk/maths/braids/, for a program, due 
to Andrew Bartholomew and Roger Fenn, which does this and much more. 



Garside's algorithm for the word and conjugacy problems. Many people have 
programmed Garside's algorithm for the word and conjugacy problem, but the one we 
know best is the program of Juan Gonzalez-Meneses, which can be downloaded from 
http:www.personal.us.es/meneses. The very robust version that the reader will find there 
computes Garside's normal forms for braids, and the ultra-summit set of a braid, a complete 
invariant of conjugacy. 

Nielsen-Thurston classification of mapping classes in A^o.n+i- We know of two 
very useful computer programs, all based upon the Bestvina-Handel algorithm ^2]- Both 
assume that the boundary of the n-times punctured disc has been capped with a disc, so 
that they compute in the mapping class group of B ra / modulo its center, rather than in 
B n . Equivalently, they work with the mapping class group of an (n + l)-times punctured 
sphere, where admissible maps fix the distinguished point. 

The first, due to W. Menasco and J. Ringland, can be downloaded from 



http://orange.math.buffalo.edu/software.html by following the link to "BH2.1 An Imple- 



mentation of the Bestvina-Handel Algorithm" . The second, due to T. Hall, can be down- 
loaded from http:/ /www. liv.ac.uk/Maths/pure and following the links to the research group 
on Dynamical Systems, and then to the home page of Toby Hall. Both determine whether 
an input map is pseudo-Anosov, reducible or finite order and both find an invariant train 
track and a train track map (which makes it possible to calculate essential dynamics in 
the class by Markov partition techniques). In the reducible case Hall's program provides a 
set of reducing curves. Recently Hall updated his program to adapt it to Macintosh OS-X 
computers, versions 10.2 and above. 



Other software. We mention M. Thistlethwaite's Knotscape, because the program has 
a reputation for being very versatile and user-friendly. It is available for download from 
http: //www. math. utk.edu~morwen It accepts as input knots that are defined as closed 
braids (and also knots defined by the Dowker code or mouse-drawn diagrams), and locates it 
in the tables if it has at most 16 crossings. The program computes numerous invariants, in- 
cluding the Alexander, Jones, Homfly and Kauffman polynomials, and hyperbolic invariants 
(assuming that the knot is hyperbolic). The hyperbolic routines in Knotscape were taken 
with permission from Jeff Weeks's program SnapPea; the procedures for calculating poly- 
nomials were supplied by Bruce Ewing and Ken Millett, and the procedure for producing a 
knot picture from Dowker code is part of Ken Stephenson's Circlepack program. 
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